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DRUG TARGETS IN CANDIDA ALBICANS 

The present invention is concerned wi^ the 
identification of genes or functional fragments 
thereof from Candida albicans which are critical for 
growth and cell division and which genes may be used 
as selective drug targets to treat Candida albicans 
associated infections. Novel nucleic acid sequences 
from Candida albicans are also provided and which 
encode the polypeptides which are critical for growth 
of Candida albicans. 

Opportunistic infections in immunocompromised 
hosts represent an increasingly common cause of 
mortality and morbidity. Candida species are among 
the most commonly identified fungal pathogens 
associated with such opportunistic infections, with 
Candida albicans being the most common species. Such 
fungal infections are thus problematical in, for 
example, AIDS populations in addition to normal 
healthy women where Candida albicans yeasts represent 
the most common cause of vulvovaginitis. 

Although compounds do exist for treating such 
disorders, such as, amphotericin, these drugs are 
generally limited in their treatment because of their 
25 toxicity and side effects. Therefore, there exists a 
need for new compounds which may be used to treat 
Candida associated infections in addition to compounds 
which are selective in their action against Candida 
albicans. 

Classical approaches for identifying anti-fungal 
compounds have relied almost exclusively on inhibition 
of fungal or yeast growth as an endpoint. Libraries 
of natural products, semi-synthetic, or synthetic 
chemicals are screened for their ability to kill or 
arrest growth of the target pathogen or a related 
nonpathogenic model organism. These tests are 
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cumbersome and provide no information about a 
compound's mechanism of action. The promising lead 
compounds that emerge from such screens muat., then be 
tested for possible host-toxicity and detailed 
mechanism of action studies must subsequently be 
conducted to identify the affected molecular target. 

The present inventors have now identified a range 
of nucleic acid sequences from Candida albicans which 
encode polypeptides which are critical for its 
survival and growth. These sequences represent novel 
targets which can be incorporated into an assay to 
selectively identify compounds capable of inhibiting 
expression of such polypeptides and their potential 
use in alleviating diseases or conditions associated 
15 with Candida albicans infection. 

Theref' ..e, according to a first aspect of the 
invention there is provided a nucleic acid molecule 
encoding a polypeptide which is critical for survival 
and growth of the yeast Candida albicans and which 
nucleic acid molecule comprises any of the sequences 
of nucleotides illustrated in any of Sequence ID Nos. 
1 to 9. 

Whilst the molecules defined herein have been 
established as being critical for growth and 
25 metabolism of Candida albicans, for some of the 

molecules no apparent functionality has been assigned 
by virtue of the fact that no functionally related 
sequences in other prokaryotic or eukaryotic organism 
can be found in respective databases. Thus, 
advantageously these sequences may be species specific 
in which case they may be used be used as selective 
targets for treatment of diseases mediated by Candida 
Albicans infection. Thus, in one aspect of the 
invention the nucleic acid molecules preferably 
comprise the sequences identified in sequence ID Nos. 
1, 4, 5 to 9. 
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In another aspect of the invention the sequences 
have been arranged functionally and of nucleotides 
illustrated in Sequence ID Nos. 2 or 3 are .preferred 
and even more preferably in Sequence ID No. 2 and 
fragments or derivatives of said nucleic acid 
molecules. 

Letters utilised in the sequences according to 
the invention which are not recognisable as letters of 
the genetic code signify a position in the nucleic 
acid sequence where one or more of bases- A, G, C or T 
can occupy the nucleotide position. Representative 
letters used to identify the range of bases which can 
be used are as follows: 



M: 


A 


or 


C 






R; 


A 


or 


G 






W: 


A 


or 


T 






S: 


C 


or 


G 






Y: 


C 


or 


T 






K: 


G 


or 


T 






V: 


A 


or 


C 


or 


G 


H: 


A 


or 


c 


or 


T 


D:- 


A 


or 


G 


or 


T 


B: 


C 


or 


G 


or 


T 


N: 


G 


or 


A 


or 


T or C 



25 



In one embodiment of each of the above identified 
aspects of the invention the nucleic acid may comprise 
a mRNA molecule or alternatively a DNA and preferably 
30 a cDNA molecule. 

Also provided by the present invention is a 
nucleic acid molecule capable of hybridising to the 
nucleic acid molecules illustrated in any of Figures 1 
to 9 under high stringency conditions such as 
35 antisense molecule and which conditions are generally 
known to those of skill in the art. 

Stringency of hybridisation as used herein refers 
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to conditions under which polynucleic acids are 
stable. The stability of hybrids is reflected in the 
melting temperature (Tm) of the hybrids. Tm_can be 
approximated by the formula: 

5 

81.5'C+16.6(Logio[Na*)+0.41 (%GSC)-600L/L 

wherein L is the length of the hybrids in nucleotides. 
Tm decreases approximately by 1-1. S^C with every 1% 
ID decrease in sequence homology. 

The term "stringency" refers to the hybridisation 
conditions wherein a single-stranded nucleic acid 
joins with a complementary strand when the purine or 
pyramidine bases therein pair with their corresponding 
base by hydrogen bonding. High stringency conditions 
fa- our homologous base pairing whereas low stringency 
conditions disfavour non-homologous base pairing. 

"Low s-ringency" conditions comprise, for 
example, a temperature of about 37 °C or less, a 
formamide concentration of less than about 50%, and a 
moderate to low salt (SSC) concentration; or, 
alternatively, a temperature of about SCC or less, 
and a moderate t6 high salt (SSPE) concentration, for 
example IM NaCl. 

"High stringency" conditions comprise, for 
example, a temperature of about 42''C or less, a 
formamide concentration of less than about 20%, and a 
low salt (SSC) concentration; or, alternatively, a 
temperature of about 65 "C, or less, and a low salt 
(SSPE) concentration. For example, high stringency 
conditions comprise hybridization in 0.5 M NaHPO^, 7% 
sodium dodecyl sulfate (SDS), 1 mM EDTA at eS'C 
(Ausubel, F.M. et al. Current Prn^ ocol.c, M^i.^.n.^ 
Biology , Vol. I, 1989; Green Inc. New Yorlc, at 
35 2.10.3). 

"SSC" comprises a hybridization and wash 
solution. A stock 20X SSC solution contains 3M sodium 
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chloride, 0.3M sodium citrate, pH 7.0. 

"SSPE" comprises a hybridization and wash 
solution. A IX SSPE solution contains 180_mM NaCl, 
9mM NajHPO, and 1 mM EDTA, pH 7.4. 
5 The nucleic acid capable of hybridising to 

nucleic acid molecules according to the invention will 
generally be at least 70%, preferably at least 80 or 
90% and more preferably at least 95% homologous to the 
nucleotide sequences illustrated in any of Figures 1 
10 to 9. 

T.he DNA molecules according to the invention may, 
advantageously, be included in a suitable expression 
vector to express polypeptides encoded therefrom in a 
suitable host which are critical for growth and 
15 survival of Candida albicans. 

An expression vector according to t:.- invention 
includes a vector having a nucleic acid according to 
the invention operably linked to regulatory sequences, 
such as promoter regions, that are capable of 
effecting expression of said DNA fragments. The term 
"operably linked" refers to a juxtaposition wherein 
the components described are in a relationship 
permitting them to function in their intended manner. 
Such vectors may "be transformed into a suitable host 
25 cell to provide for expression of a polypeptide 
according to the invention. Thus, in a further 
aspect, the invention provides a process for preparing 
polypeptides according to the invention which 
comprises cultivating a host cell, transformed, 
30 transfected or infected with an expression vector as 
described above under conditions to provide for 
expression by the vector of a coding sequence encoding 
the polypeptides, and recovering the expressed 
polypeptides. 

35 The vectors may be, for example, plasmid, virus 

or phage vectors provided with an origin of 
replication, optionally a promoter for the expression 
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of said nucleotide and optionally a regulator of the 
promoter. The vectors may contain one or more 
selectable markers, such as, for example, ampicillin 
resistance. ~ 

Regulatory elements required for expression 
include promoter sequences to bind RNA polymerase and 
transcription initiation sequences for ribosome 
binding. For example, a bacterial expression vector 
may include a promoter such as the lac promoter and 
for translation initiation the Shine-D^.lgarno sequence 
and the start codon AUG. Similaiiy, a eukaryotic 
expression vector may include a heterologous or 
homologous promoter for RNA polymerase II, a 
downstream polyadenylation signal, the start codon 
15 AUG, and a termination codon for detachment of the 

ribosome. Such vectors may be obtained commercially 
or assembled from the sequences described by methods 
well known in the art. 

Polynucleotides according to the invention may be 
inserted into the vectors described in an antisense 
orientation in order to provide for the production of 
antisense RNA. Antisense RNA or other antisense 
nucleic acids may be produced by synthetic means. 

In accordance with the present invention, a 
defined nucleic acid includes not only the identical 
nucleic acid but also any minor base variations 
including in particular, substitutions in bases which 
result in a synonymous codon (a different codon 
specifying the same amino acid residue) due to the 
degenerate code. The term "nucleic acid sequence" 
also includes the complementary sequence to any single 
stranded sequence given regarding base variations. 

The present invention also comprises within its 
scope proteins or polypeptides expressed by the 
nucleic acid molecules according to the invention or a 
functional equivalent, derivative or bioprecursor 
thereof. 
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The present invention also advantageously 
provides nucleic acid sequences of at least 
approximately 10 contiguous nucleotides of nucleic 
acid according to the invention and preferably from 10 
to approximately 120 nucleotides. In another aspect 
of the invention, nucleotide acid sequences are 
provided from 10 to 50 nucleotides. These sequences 
may, advantageously be used as probes or primers to 
initiate replication, or the like. Such nucleic acid 
sequences may be produced according to techniques well 
known in the art, such as by recombinant or synthetic 
means. They may also be used in diagnostic kits or 
the like for detecting the presence of a nucleic acid 
according to the invention. These tests generally 
comprise contacting the probe with the sample under 
hybridising conditions and detrr.ting for the presence 
of any duplex or triplex formation between the probe 
and any nucleic acid in the sample. 

According to the present invention, these probes 
may be anchored to a solid support. Preferably, they 
are present on an array so that multiple probes can 
simultaneously hybridize to a single biological 
sample. The probes can be spotted onto the array or 
synthesized in situ on the array. See Lockhart et a2.. 
Nature Biotechnology, Vol. 14, December 1996, 
"Expression monitoring by hybridization to high- 
density oligonucleotide arrays." A single array can 
contain more than up to more than a million different 
probes in discrete locations. 

Advantageously, the nucleic acid sequences, 
according to the invention may be produced using such 
recombinant or synthetic means, such as for example 
using PGR cloning mechanisms which generally involve 
making a pair of primers, which may be between 
approximately 10 to 120 nucleotides to a region of the 
gene which is desired to be cloned, bringing the 
primers into contact with mRNA, cDNA, or genomic DNA 
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from a cell, performing a polymerase chain reaction 
under conditions which bring about amplification of 
the desired region, isolated the amplified .jrsgion or 
fragment and recovering the amplified DNA. Generally, 
such techniques as defined herein are well known in 
the art, such as described in Sambrook et ai 
(Molecular Cloning: a Laboratory Manual, 1989). 

The nucleic acids or oligonucleotides according 
to the invention may carry a revealing label. 
Suitable labels include radioisotopes such as «p or 
''3, enzyme labels or other protein labels such as 
biotin or fluorescent markers. Such labels may be 
added to the nucleic acids or oligonucleotides of the 
invention and may be detected using known techniques 
15 per se. 

The polypeptide or protein according to the 
invention includes all possible amino acid variants 
encoded by the nucleic acid molecule according to the 
invention including a polypeptide encoded by said 
molecule and having conservative amino acid changes. 
Polypeptides according to the invention further 
include variants of such sequences, including 
naturally occurring allelic variants which are 
substantially homologous to said polypeptides. in 
this context, substantial homology is regarded as a 
sequence which has at least 70%, preferably 80 or 90% 
amino acid homology with the polypeptides encoded by 
the nucleic acid molecules according to the invention. 

Nucleic acids and polypeptides which are 
particularly preferred are those comprising the 
sequences of nucleotides illustrated in figures 1 to 3 
and polypeptides illustrated in figures 14 to 16. 
However, a particularly preferred nucleic acid 
comprises the sequences of nucleotides illustrated in 
Figures 2 and/or 3, and their corresponding amino acid 
sequences identified in Figures 15 and 16. 

Nucleotide sequences according to the invention 
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are particularly advantageous as selective therapeutic 
targets for treating Candida albicans associated 
infections. For example, an antisense nuqleic acid 
capable of binding to the nucleic acid sequence 
illustrated in any of Figures 1 to 9 may be used to 
selectively inhibit expression of the corresponding 
polypeptides, leading to impaired growth of the 
Candida albicans with reductions of associated 
illnesses cr diseases. 

The nucleic acid molecule or the polypeptide 
according to the invention may be used as a 
medicament, or in the preparation of a medicament, for. 
treating diseases or conditions associated with 
Candida albicans infection. 

Advantageously, the nucleic acid molecule or the 
polypeptide accord: ag to the invention may be provided 
in a pharmaceutical composition together with a 
pharmaceutically acceptable carrier, diluent or 
excipient therefor. 

The present invention is further directed to 
inhibiting expression of nucleic acids according to 
the invention ,in vivo by the use of antisense 
technology. Antisense technology can be used to 
control gene expression through triple-helix formation 
25 of antisense DNA or RNA, both of which methods are 
based on binding of a polynucleotide to DNA or RNA. 
For example, the 5' coding portion or the mature 
protein sequence, which encodes for the protein of the 
present invention, is used to design an antisense RNA 
oligonucleotide of from 10 to 50 base pairs in length. 
A DNA oligcrucleotide is designed to be complementary 
to a region of the gene involved in transcription 
(triple-helix - see Lee et ai. Nucl. Acids Res., 
6:3073 (1979); Cooney et ai.. Science, 241:456 (1988); 
and Dervan et ai., Science, 251: 1360 (1991), thereby 
preventing .ranscription and the production of the 
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corresponding protein. The antisense RNA 
oligonucleotide hybridises to the mRNA in vivo and 
blocks translation of an mRNA molecule inttrthe 
corresponding protein (antisense - Okano, J. 
Neurochem., 56:560 (1991); Oligodeoxynucleotides as 
Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). 

Antibodies to the protein or polypeptide of the 
present invention may, advantageously, be prepared by 
techniques which are known in the art. tor example, 
polyclonal antibodies may be prepared by inoculating a 
host animal, such as a mouse, with the polypeptide 
according to the invention or an epitope thereof and 
recovering immune serum. Monoclonal antibodies may be 
15 prepared according to known techniques such as 

described by Kohler R.- and Milstein C.,. Nature (1975) 
256, 495-497. 

Antibodies according to the invention may also be 
used in a method of detecting for the presence of a 

20 polypeptide according to the invention, which method 
comprises reacting the antibody with a sample and 
identifying any protein bound to said antibody. A kit 
may also be provided for performing said method which 
comprises an antibody according to the invention and 

25 means for reacting the antibody with said sample. 

Proteins which interact with the polypeptide of 
the invention may be identified by investigating 
protein-protein interactions using the two-hybrid 
vector system first proposed by Chien et ai. (1991). 

This technique is based on functional 
reconstitution in vivo of a transcription factor which 
activates a reporter gene. More particularly the 
technique comprises providing an appropriate host cell 
with a DNA construct comprising a reporter gene under 

35 the control of a promoter regulated by a transcription 
factor having a DNA binding domain and an activating 
domain, expressing in the host cell a first hybrid DNA 
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sequence encoding a first fusion of a fragment or all 
of a nucleic acid sequence according to the invention 
and either said DNA binding domain or said -activating 
domain of the transcription factor, expressing in the 
5 host at least one second hybrid DNA sequence, such as 
a library or the like, encoding putative binding 
proteins to be investigated together with the DNA 
binding or activating domain of the transcription 
factor which is not incorporated in the first fusion; 

10 detecting any binding of the proteins to be 

investigated with a protein according to the invention 
by detecting for the presence of any reporter gene 
product in the host cell; optionally isolating second 
hybrid DNA sequences encoding the binding protein , 

15 An example of such a technique utilises the GAL4 

protein in yeast. GAL4 is a transcriptional activator 
of galactose metabolism in yeast and has a separate 
domain for binding to activators upstream of the 
galactose metabolising genes as well as a protein 

20 binding domain. Nucleotide vectors may be 

constructed, one of which comprises the nucleotide 
residues encoding the DNA binding domain of GAL4 . 
These binding domain residues may be fused to a known 
protein encoding sequence, such as for example the 

25 nucleic acids according to the invention. The other 
vector comprises the residues encoding the protein 
binding domain of GAL4 . These residues are fused to 
residues encoding a test protein. Any interaction 
between polypeptides encoded by the nucleic acid 

30 according to the invention and the protein to be 
tested leads to transcriptional activation of a 
reporter molecule in a GAL4 transcription deficient 
yeast cell into which the vectors have been 
transformed. Preferably, a reporter molecule such as 

35 fi-galactosidase is activated upon restoration of 

transcription of the yeast galactose metabolism genes. 
Further provided by the present invention is one 
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or more Candida albicans cells comprising an induced 
mutation in the DNA sequence encoding the polypeptide 
according to the invention. --^.-r 

A further aspect of the invention provides a 
5 method of identifying compounds which selectively 
inhibit or interfere with the expression, the 
functionality of polypeptides expressed from the 
nucleotides sequences illustrated in any of Figures 1 
to 9 or the metabolic pathways in which these 

10 polypeptides are involved and which are' critical for 
growth and survival of Candida albicans, which method 
comprises (a) contacting a compound to be tested with 
one or more Candida albicans cells having a mutation 
in a nucleic acid molecule according to the invention 

15 which mutation results in overexpression or 

underexpression of said polypeptides in adaition to 
one or more wild type Candida cells, (b) monitoring 
the growth and/or activity of said mutated cell 
compared to said wild type wherein differential growth 

20 or activity of said one or more mutated Candida cells 
provides an indication of selective action of said 
compound on said polypeptide or another polypeptide in 
the same or a parallel pathway. 

Compounds identifiable or identified using the 

25 method according to the invention, may advantageously 
be used as a medicament, or in the preparation of a 
medicament to treat diseases or conditions associated 
with Candida albicans infection. These compounds may 
also advantageously be included in a pharmaceutical 

30 composition together with a pharmaceutically 

acceptable carrier, diluent or excipient therefor. 

A further aspect of the invention provides a 
method of identifying DNA sequences from a cell or 
organism which DNA encodes polypeptides which are 

35 critical for growth or survival, which method 

comprises (a) preparing a cDNA or genomic library from 
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said cell or organism in a suitable expression vector 
which vector is such that it can either integrate into 
the genome in said cell or that it permits -^.-r 
transcription of antisense RNA from the nucleotide 

5 sequences in said cDNA or genomic library, (b) 

selecting transf ormants exhibiting impaired growth and 
determining the nucleotide sequence of the cDNA or 
genomic sequence from the library included in the 
vector from said transf ormant , Preferably, the cell 

0 or organism may be any yeast or filamentous fungus, 
such as, for example, Saccharomyces cerevisiae, 
Schizosaccharomyces pombe or Candida albicans . 

A further aspect of the invention provides a 
pharmaceutical composition comprising any of a 

5 compound, an antisense molecule or an antibody 
according to the invention together with a 
pharmaceutically acceptable carrier, diluent or 
excipient therefor. 

The antisense molecules or indeed the compounds 

3 identified as agonists or antagonists of the nucleic 
acids or polypeptides according to the invention may 
be used in the -form of a pharmaceutical composition, 
which may be prepared according to procedures well 
known in the art. Preferred compositions include a 

j pharmaceutically acceptable vehicle or diluent or 

excipient, such as for example, a physiological saline 
solution. Other pharmaceutically acceptable carriers 
including other non-toxic salts, sterile water or the 
like may also be used. A suitable buffer may also be 

) present allowing the compositions to be lyophilized 
and stored in sterile conditions prior to 
reconstitution by the addition of sterile water for 
subsequent administration. Incorporation of the 
polypeptides of the invention into a solid or semi- 

• solid biologically compatible matrix may be carried 
out which can be implanted into tissues requiring 
treatment . 
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The carrier can also contain other 
pharmaceutically acceptable excipients for modifying 
other conditions such as pH, osmolarity, vtOTosity, 
sterility, lipophilicity, solubility or the like. 
5 Pharmaceutically acceptable excipients which permit 

sustained or delayed release following administration 
may also be included. 

The polypeptides, the nucleic acid molecules or 
compounds according to the invention may be 
10 administered orally. In this embodiment they may be 
encapsulated and combined with suitable carriers in 
solid dosage forms which would be well known to those 
skilled in the art. 

As would be well known to those of skill in the 
15 art, the specific dosage regime r.ay be calculated 

according to the body surface area of the patient or 
the volume of body space to be occupied, dependent 
upon the particular route of administration to be 
used. The amount of the composition actually 
20 administered will, however, be determined by a medical 
practitioner, based on the circumstances pertaining to 
the disorder to be treated, such as the severity of 
the symptor:s, the composition to be administered, the 
age, weigh-, and response of the individual patient 
25 and the chosen route of administration- 

The present invention may be more clearly 
understood with reference to the accompanying example, 
which is purely exemplary, with reference to the 
accompanying drawings, wherein 

30 

Figures 1 and 2: are nucleotide sequences isolated 

from Candida albicans and which 
have an identified function based 
on sequence homology with proteins 
from other organisms and which 
sequences are not present in the 
public domain. 
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Figures 3 



is a nucleotide sequence isolated 
from Candida albicans and which 
has an identified funct^ion based 
on sequence homology with proteins 
from other organisms and which 
sequence is partially present in 
the public domain. 



10 



Figures 4 



is a nucleotide sequence of 
previously unknown function 
isolated from Candida albicans and 
which is partially present in the 
public domain. 



15 



Figures 5 to 9 



are nucleotide sequences of 
previously unknown function 
isolated from Candida albicans. 



20 



Figure 10 



is a diagrammatic representation 
of plasmid pGALlPNiST-1 . 



Figure 11 



is a nucleotide sequence of 
plasmid pGALlPNiST-1 of Figure 10, 



25 



Figure 12 



is a diagrammatic representation 
of plasmid pGALlPSiST-1 . 



30 



35 



Figure 13 



Figures 14 to 20: 



Figures 21 to 27: 



is a nucleotide sequence of 
plasmid pGALlPSiST-1 of Figure 12. 

are amino acid sequences of the 
appropriately corresponding DNA 
sequences illustrated in Figures 1 
to 9 with reference to Table 1. 

are growth curves of Candida 
albicans strains showing antisense 
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induced reduction in growth. 



Figures 28 to 31: are growth curves of C&Ftdlda 

albicans strains including knock- 
outs in the relevant gene 
identified. 

Example 1 

Identification of novel drug targets in C, 
albicans by anti-sense and disruptive integration 

The principle of the approach is based on the 
fact that when a particular C. albicans ruRNA is 
inhibited by producing the complementary anti-sense 
RNA, the corresponding protein will decrease. If this 
15 protein is critical for growth or survival, the cell 
producing the anti-sense RNA will grow more slowly or 
will die. 

Since anti-sense inhibition occurs at mRNA level, 
the gene copy number is irrelevant, thus allowing 
20 applications of the strategy even in diploid 
organisms , 

Anti-sense RNA is endogenously produced from an 
integrative or episomal plasmid with an inducible 
promoter; induction of the promoter leads to the 

25 production of an RNA encoded by the insert of the 

plasmid. This insert will differ from one plasmid to 
another in the library. The inserts will be derived 
from genomic DNA fragments or from cDNA to cover-to 
the extent possible- the entire genome. 

'^^^ vector is a proprietary vector allowing 
integration by homologous recombination at either the 
homologous insert or promoter sequence in the Candida 
genome. After introducing plasmids from cDNA or 
genomic libraries into C. albicans, transf ormants are 

35 screened for impaired growth after promoter (& thus 
anti-sense) induction in the presence of lithium 
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acetate. Lithium acetate prolongs the Gl phase and 
thus allows anti-sense to act during a prolonged 
period of time during the cell cycle. Trans^Crmants 
which show impaired growth in both induced and non- 
5 induced media, thus showing a growth defect due to 
integrative disruption, are selected as well. 

Transformants showing impaired growth are 
supposed to contain plasmids which produce anti-sense 
RNA to mRNAs critical for growth or survival. Growth 

10 is monitored by measuring growth-curves "over a period 
of time in a device (Bioscreen Analyzer, Labsystems) 
which allows simultaneous measurement of growth-curves 
of 200 transformants. 

Subsequently plasmids can be recovered from the 

15 transformants and the sequence of their inserts 

determined, thus revealing which mRNA they inhibit. In 
order to be able to recover the genomic or cDNA insert 
which has integrated into the Candida genome, genomic 
DNA is isolated, cut with an enzyme which cuts only 

20 once into the library vector (and estimated approx. 

every 4096 bp in the genome) and religated. PGR with 
primers flankihg^ the insert will yield (partial) 
genomic or cDNA inserts as PGR fragments which can 
directly be sequenced. This PGR analysis (on ligation 

25 reaction) will also show us how many integrations 
occurred. Alternatively the ligation reaction is 
transformed to E. coli and PGR analysis is performed 
on colonies or on plasmid DNA derived thereof. 

This method is employed for a genome-wide search 

30 for novel C. albicans genes which are important for 
growth or survival. 

Materials & M^-hVirtHg 
Construction of pGallPNiST-1 
35 The backbone of the pGALlPNiST-1 vector 

(integrative anti-sense Sfil-Notl vector) is 



wo 00/34481 



PCT/EP99y09833 



- 18 - 

pGEM112f(+) (Promega Inc.). First, the CaMAL2 
EcoRl/Sall promoter fragment from pDBVSO (D.H. Brown 
et al. 1996) was ligated into EcoRI/ Sail -opened 
pGEMllZf (+) resulting in the intermediate construct 
5 PGEMMAL2P-1. Into the latter (AfscI/CIP) the CaUPA3 
selection marker was cloned as a £co47III/XmnI 
fragment derived from pRM2. The resulting pGEMMAL2P-2 
vector was Notl/Hindlll opened in order to accept the 
Wotl-stuf fer-sni cassette from pPCKlNiSCYCT-1 
10 (Eagl/Hindlll fragment): pMAL2PNiST-l . Finally, the 

plasmid pGALlPNiST-1 was constructed by exchanging the 
SaiI/£:cil36IIMAL2 promoter in pMAL2PNiST-l by the 
Xhol/Smal GALl promoter fragment derived from 
PRM2GAL1P. 

15 

Construction of pGallPSiST-1 

The vector pGALlPSiST-1 was created for cloning 
the small genomic DNA fragments (flanked by Sfil 
sites) behind the GALl promoter. The only difference 

20 with pGALlPNiST-1 is that the hlFNp (stuffer fragment) 
insert fragment in pGALlPSiST-1 is flanked by two Sfil 
sites in stead "of a Sfil and a A^otl site as in 
pGALlPNiST-1. To construct pGALlPSiST-1 the £coRI- 
Hindlll fragment, containing hlFNP flanked by a Sfil 

25 and a Notl site, of pMAL2pHiET-3 (unpublished) was 
exchanged by the EcoRI-Hindlll fragment, containing 
hlFNP flanked by two Sfil sites, from YCp50S-S (an E. 
coli / S. cerevisiae shuttle vector derived from the 
plasmid YCp50, which is deposited in the ATCC 

30 collection (number 37419; Thrash et ai., 1985); an 
EcoRI-Hindlll fragment, containing the gene hlFNP, 
which is flanked by two Sfil sites, was inserted in 
YCp50, creating YCp50S-S) , resulting into plasmid 
pMAL2PSiST-l . The MAL2 promoter from pMAL2PSiST-l (by 

35 a Nael-ball digest) was further replaced by the GALl 
promoter from pGALlPNiST-1 (via a Xhol-FSPl digest), 
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creating the vector pGALlPSiST-1 . 

Candida alhlcans genomic library ^^.-r 

* Preparation of the genomic DMA fragments 
5 A Candida albicans genomic DNA library with small DNA 
fragments (400 to 1,000 bp) was prepared. Genomic DNA 
of Candida albicans B2630 was isolated following a 
modified protocol of Blin and Stafford (1976) . The 
quality of the isolated genomic DNA was checked by gel 

10 electrophoresis. Undigested DNA was located on the gel 
above the marker band of 26,282 bp. A little smear, 
caused by fragmentation of the DNA, was present. 
To obtain enrichment for genomic DNA fragments of the 
desired size, the genomic DNA was partially digested. 

15 Several restriction enzymes (Aiul, Haelll and Rsal; 
all creating blunt ends) were tried out. The 
appropriate digest conditions have been determined by 
titration of the enzyme. Enrichment of small DNA 
fragments was obtained with 70 units of AIuI on 10 yg 

20 of genomic DNA for 20 min. T4 DNA polymerase 

(Boehringer) and dNTPs (Boehringer) were added to 
polish the DNA ends. After extraction with phenol- 
chloroform the digest was size-fractionated on an 
agarose gel. The genomic DNA fragments with a length 

25 of 500 to 1,250 bp were eluted from the gel by 
centrifugal filtration (Zhu et al., 1985), Sfil 
adaptors (5' GTTGGCCTTTT) or (5' AGGCCAAC) were 
attached to the DNA ends (blunt) to facilitate cloning 
of the fragments into the vector. Therefore, a 8-mer 

30 and 11-mer oligonucleotide (comprising the Sfil site) 
were kinatesed and annealed. After ligation of these 
adaptors to the DNA fragments a second size- 
fractionation was performed on an agarose gel. The 
DNA fragments of 400 to 1150 bp were eluted from the 

35 gel by centrifugal filtration. 

* Preparation of the pGALlPSiST-l vector fragment 
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The small genomic DNA fragments were cloned after 
the GALl promoter in the vector pGALlPSIST-1 . Qiagen- 
purified pGALlPSiST-1 plasmid DNA was diges^-«d with 
Sfil and the largest vector fragment eluted from the 
gel by centrifugal filtration (Zhu et al., 1985). 
Ligation with a control DNA fragment, flanked by Sfil 
sites, was performed as a control. The ligation mix 
was electroporated to MC1061 E. coli cells. Plasmid 
DNA of 24 clones was analyzed. In all cases the 
control fragment was inserted in the pGALlPSiST-1 
vector fragment. 

* Upscaling 

All genomic DNA fragments (450 ng) were ligated 
into the pGALlPSiST-1 vector (20 ng) . After 
^lectroporation at 2500V, 40uF circa 400,000 clones 
were obtained. These clones were pooled into three 
groups and stored as glycerol slants. Also Qiagen- 
purified DNA was prepared from these clones. A clone 
analysis showed an average insert length of 600 bp and 
a percentage of 91 for clones with an insert. The size 
of the library corresponds to 5 times the diploid 
genome. The genomic DNA inserts are sense or anti- 
sense orientated- in the vector. 



25 Candida albicans cDNA library 

Total RNA was extracted from Candida albicans 
B2630 grown on respectively minimal (SD) and rich 
(YPD) medium as described by Chirgwin et al. in 
Sambrook et al 1996. mRNA was prepared from total RNA 

30 using the Invitrogen Fast Track procedure. 

First strand cDNA is synthesised with the 
Superscript Reverse Transcriptase (BRL) and with an 
oligo dT'Notl Primer adapter. After second strand 
synthesis, cDNA is polished with Klenow enzyme and 

35 purified over a Sephacryl S-400 spun column. 

Phosphorylared Sfil adapters are then ligated to the 
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cDNA, followed by digestion with the NotI restriction 
enzyme. The Sfil/NotI cDNA is then purified and sized 
on a Biogel column A150M, --^-r 

First fraction contains approximately 38,720 
5 clones by transformation, the second fraction only 
1540 clones. Clone analysis: 

Fr. I: 22/24 inserts, 16 ^ 1000 bp, 4 ^ 2000 bp, 
average size: 1500 bp. 

Fr. II: 9/12 inserts, 3 ^ 1000 bp, average size: 960 
10 bp cDNA was ligated in a Notl/Sfil opened pGALlPNiST-1 
vector (anti-sense) 

Can dida transformation 

The host strain used for transformation is a C. 
15 albicans ura3 mutant, CAI-4, which conta-: ns a deletion 
in orotidine-5' -phosphate decarboxylase and was 
obtained from William Fonzi, Georgetown University 
(Fonzi and Irwin) . CAI-4 was transformed with the 
above described cDNA library or genomic library using 
the Plchia spheroplast module (Invitrogen) . Resulting 
transformants were plated on minimal medium 
supplemented with glucose (SD, 0.67% or 1.34% Yeast 
Nitrogen base w/o amino acids + 2% glucose) plates 
and incubated for 2-3 days at 30''C. 

25 

Screening for mutants 

Starter cultures were set up by inoculating each 
colony in 1 ml SD medium and incubating overnight at 
30°C and 300 rpm. Cell densities were determined using 

30 a Coulter counter (Coulter 21; Coulter electronics 

limited). 250.000 cells/ml were inoculated in 1 ml SD 
medium and cultures were incubated for 24 hours at 
30'*C and 300 rpm. Cultures were washed in minimal 
medium without glucose (S) and the pellet resuspended 

35 in 650 pi S medium. 8 ^1 of this culture is used for 
inoculating 400 pi cultures in a Honeywell-100 plate 



20 
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10 



(Bioscreen analyzer; Labsystems) . Each transf ormant 
was grown during three days in S medium containing 
LiAc; pH 6.0, with 2% glucose/2% maltose or-^ 
galactose/2% maltose respectively while shaking every 
3 minutes for 20 seconds. Optical densities were 
measured every hour during three consecutive days and 
growth curves were generated (Bioscreen analyzer; 
Labsystems) . 

Growth curves of transf ormants grown in 
respectively anti-sense non-inducing (glxicose/maltose) 
and inducing (galactose/maltose) medium are compared 
and those transf ormants showing impaired growth upon 
anti-sense induction are selected for further 
analysis. Transf ormants showing impaired growth by 
15 virtue of integration into a critical gene are also 
selected. 

Isolation of genomic or cDNA inserts 

Putatively interesting transf ormants are grown in 
1.5 ml SD overnight and genomic DNA is isolated using 
20 the Nucleon MI . Yeast kit (Clontech) . Concentration of 
genomic DNA is estimated by analyzing a sample on an 
agarose gel. 

20 ng of genomic DNA is digested for three hours 
with an enzyme that cuts uniquely in the library 
25 vector (Sad for the genomic library; PstI for the 
cDNA library) and treated with RNAse. Samples are 
phenol/chloroform extracted and precipitated using 
NaOAc/ethanol. 

The resulting pellet is resuspended in 500 pi 
ligation mixture (1 x ligation buffer and 4 units of 
T4 DNA ligase; both from Boehringer) and incubated 
overnight at 16°C. 

After denaturation (20 min 65**C) , purification 
(phenol/chloroform extraction) and precipitation 
35 (NaOAc/ethanol) the pellet is resuspended in 10 m1 
MilliQ (Millipore) water. 



30 
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PGR analysis 

Inverse PCR is performed on 1 ^1 of the 
precipitated ligation reaction using library^ector 
specific primers {oiigo23 5* TGC-AGC-TCG-ACC-TCG-ACT-G 
5 3' and oligo2 5 5' GCG-TGA-ATG-TAA-GCG-TGA-C 3* for the 
genomic library; 3pGALNistPCR primer 
:5'TGAGCAGCTCGCCGTCGCGC 3' and SpGALNistPCR primer: 
5'GAGTTATACCCTGCAGCTCGAC 3' for the cDNA library; both 
from Eurogentec) for 30 cycles each consisting of (a) 

10 1 min at 95 °C, (b) 1 min at 57 **C, and (c) 3 min at 
72 °C. In the reaction mixture 2.5 units of Taq 
polymerase (Boehringer) with TaqStart antibody 
(Clontech) (1:1) were used, and the final 
concentrations were 0.2 of each primer, 3 mM MgC12 

15 (Perkin Elmer Cetus) and 200 p'.rdNTPs {Perkin Elmer 
Cetus) . PCR was performed in a Robocycler 
(Stratagene) . 

Sequence determination 

20 Resulting /PCR products were purified using PCR 

purification kit (Qiagen) and were quantified by 
comparison of band intensity on EtBr stained agarose 
gel with the intensity of DNA marker bands. The amount 
of PCR product (expressed in ng) used in the 

25 sequencing reaction is calculated as the length of the 
PCR product in basepairs divided by 10. Sequencing 
reactions were performed using the ABI Prism Bigbye 
Terminator Cycle Sequencing Ready Reaction Kit 
according to the instructions of the manufacturer (PE 

30 Applied Biosystems, Foster City, CA) except for the 
following modifications. 

The total reaction volume was reduced to 15 pi. 
Reaction volume of individual reagents were changed 
accordingly. 6.0 pi Terminator Ready Reaction Mix was 

35 replaced by a mixture of 3.0 pi Terminator Ready 
Reaction Mix + 3.0 pi Half Term (GENPAK Limited, 
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Brighton, UK) . After cycle sequencing, reaction 
mixtures were purified over Sephadex G50 columns 
prepared on Multiscreen HV opaque microtiteiC-rplates 
(Millipore, Molsheim, Fr) and were dried in a 
5 speedVac. Reaction products were resuspended in 3 ^1 
loading buffer. Following denaturation for 2 min at 
95°C, 1 Ail of sample was applied on a 5% Long Ranger 
Gel (36 cm well-to-read) prepared from Singel Packs 
according to the supplier's instructions (FMC 

10 BioProducts, Rockland, ME) . Samples were run for 7 
hours 2X run on a ABI 377XL DNA sequencer. Data 
collection version 2.0 and Sequence analysis version 
3.0 (for basecalling) software packages are from PE 
Applied Biosystems. Resulting sequence text files 

15 were copied onto a server for further analysis. 

Sequence analysis 

Nucleotide sequences were imported in the 
VectorNTI software package (InforMax Inc, North 

20 Bethesda, MD, USA), and the vector and insert regions 
of the sequences were identified. Sequence similarity 
searches against public and commercial sequence 
databases were performed with the BLAST software 
package (Altschul et ai., 1990) version 1.4. Both the 

25 original nucleotide sequence and the six-frame 

conceptual translations of the insert region were used 
as query sequences. The used public databases were the 
EMBL nucleotide sequence database (Stoesser et ai., 
1998), the SWISS-PROT protein sequence database and 

30 its supplement TrEMBL (Bairoch and Apweiler, 1998), 
and the ALCES Candida albicans sequence database 
(Stanford University, University of Minnesota) . The 
commercial sequence databases used were the LifeSeq® 
human and PathoSeqO microbial genomic databases 

35 (Incyte Pharmaceuticals Inc., Palo Alto, CA, USA), and 
the GENESEQ patent sequence database (Derwent, London, 
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UK) . Three major results were obtained on the basis of 
the sequence similarity searches: function, novelty, 
and specificity. A putative function was decteced on 
the basis of the similarity with sequences with a 
5 known function, the novelty was based on the absence 
or presence of the sequences in public databases, and 
the specificity was based on the similarity with 
vertebrate homologues. 

10 Methods 

Blastx of the nucleic acid sequences against the 
appropriate protein databases: Swiss-Prot for clones 
of which the complete sequence is present in the 
public domain, and paorfp (PathoSeq™) for clones of 
15 which the complete r.^quences is not present in the 
public domain. 

The protein to which the translated nucleic acid 
sequence corresponds to is used as a starting point. 
The differences between this protein and our 
20 translated nucleic acid sequences are marked with a 

double line and annotated above the protein sequence. 
The following symbols are used: 

a one-letter- amino acid code or the ambiguity 
code X is used if our translated nucleic acid sequence 
25 has another amino acid on a certain position, 

the stop codon sign * is used if our translated 
nucleic acid sequence has a stop codon on a certain 
position. 

The letters fs (frame shift) are used if a frame 
30 shift occurs in our translated nucleic acid sequence, 
and another reading frame is used, 

the words ambiguity or ambiguities are used if a 
part of our translated nucleic acid sequence is 
present in the proteins, but not visible in the 
35 alignments of the blast results. 

The phrase "missing sequence" is used if the 
translated nucleic acid sequence does not comprise 
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that part of the protein. 

Blastx: compares the 'six-frame conceptual 
translation products of a nucleotide query -sequence 
(both strands) against a protein sequence database. 

5 

Gene Knock-outs 

To verify that the growth effect was due to the 
interference with the identified gene and to support 
the specificity of the antisense effect, single allele 
10 knock-outs were made in the identified genes (Figures 
28 to 31) using the URA-blaster method (Fonzi and 
Irwin 1993) . 

Screening for compounds modulating expression of 
15 polypeptides critical for growth and survival of C. 

The method proposed is based on observations 
(Sandbaken et ai., 1990; Hinnebusch and Liebman 1991; 
Ribogene PCT WO 95/11969, 1995) suggesting that 

20 underexpression or overexpression of any component of 
a process (e.g. translation) could lead to altered 
sensitivity to an inhibitor of a relevant step in that 
process. Such ah inhibitor should be more potent 
against a cell limited by a deficiency in the 

25 macromolecule catalyzing that step and/or less potent 
against a cell containing an excess of that 
macromolecule, as compared to the wild type (WT) cell. 

Mutant yeast strains, for example, have shown 
that some steps of translation are sensitive to the 

30 stoichiometry of macromolecules involved. (Sandbaken 
et ai. 1996) . Such strains are more sensitive to 
compounds which specifically perturb translation (by 
acting on a component that participates in 
translation) but are equally sensitive to compounds 

35 with other mechanisms of action. 

This method thus not only provides a means to 
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identify whether a test compound perturbs a certain 
process but also an indication of the site at which it 
exerts its effect. The component which is 'present in 
altered form or amount in a cell whose growth is 
affected by a test compound is potentially the site of 
action of the test compound. 

The assay to be set up involves measurement of 
growth of an isogenic strain which has been modified 
only in a certain specific allele, relative to a wild 
type (WT) C. albicans strain, in the presence of R- 
compounds. Strains can be ones in which the 
expression of a specific essential protein is impaired 
upon induction of anti-sense or strains which carry 
disruptions in an essential gene. An in sllico 
approacu to finding novel essential genes in C. 
aijbicans will be performed. A number of essential 
genes identified in this way will be disrupted (in one 
allele) and the resulting strains can be used for 
comparative growth screening. 

Assay for High Throughput screening for dxugs 

35 pi minimal medium (S medium + 2% galactose + 
2% maltose) is transferred in a transparent flat- 
bottomed 96 well plate using an automated pipetting 
system (Multidrop, Labsystems) . A 96-channel pipettor 
(Hydra, Robbins Scientific) transfers 2.5 pi of R- 
compound at 10"^ M in DMSO from a stock plate into the 
assay plate. 

The selected C. albicans strains (mutant and 
parent (CAI-4) strain) are stored as glycerol stocks 
(15%) at -lO^'C. The strains are streaked out on 
selective plates (SD medium) and incubated for two 
days at 30*^0. For the parent strain, CAI-4, the medium 
is always supplemented with 20 pg/ml uridine. A single 
colony is scooped up and resuspended in 1 ml minimal 
medium (S medium + 2% galactose + 2% maltose) . Cells 
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are incubated at 30°C for 8 hours while shaking at 
250 rpm. A 10 ml culture is inoculated at 250.000 
cells/ml. Cultures are incubated at 30**C far 24 
hours while shaking at 250 rpm. Cells are counted in 
5 Coulter counter and the final culture (S medium + 2% 
galactose + 2% maltose) is inoculated at 20.000 to 
50.000 cells/ml. Cultures are grown at 30**C while 
shaking at 250 rpm until a final OD of 0.24 (+/- 0.04) 
600nM is reached. 

10 200 pi of this yeast suspension is added to all 

wells of MW96 plates containing R-compounds in a 450 
(or 250) ul total volume. MW96 plates are incubated 
(static) at 30''C for 48 hours. 

Optical densities are measured after 48 hours. 

15 Test growth is expressed as a percentage of 

positive control growth for both mutant (x) and wild 
type (y) strains. The ratio (x/y) of these derived 
variables is calculated. 
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Table 1 



Seq ID No. 




Clone Function 
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X 


382c_cp 




2 


c. 


392c_cp 


TUFl 


3 






RAD53 


4 


4 


417c_cpG2L 


— 


5 




323c_af 


— 


6 


o 






7 


7 






8 


8 


4 09c_cp 




9 


9 


382c_cpGlL2 




'i 0 


14 


382c_cp (prt) 




11 


15 


392c_cp (prt) 


TUFl 


12 


16 




RAD53 


13 


17 


325c__af (prt) 2 




14 


18 


322c_cp (prt) 2 




15 


19 


26g3 (prt) 




16 


20 


417c_cp 92L (prt) 





322c-cp is a member of the UPF0057 protein 
family. It contains potential transmembrane 
regions (6-23aa; 30-53aa) and could be low 
temperature or salt-stress inducible. 



325c-af shows similarity to IMP4 yeast and 
related proteins and it might be involved in rRNA 
processing in Candida albicans in a similar way 
to IMP4. 
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Claims 



10 



1. A nucleic acid molecule encoding a_ 
polypeptide which is critical for survival and growth 
of the yeast Candida albicans and which nucleic acid 
molecule comprises any of the sequences of nucleotides 
illustrated in Seq ID Nos 1 to 9. 

2. A nucleic acid molecule encoding a 
polypeptide which is critical for survival and growth 
of the yeast Candida albicans and which nucleic acid 
molecule comprises any of the sequences of nucleotides 
illustrated in Seq ID Nos 1 to 3. 

3. A nucleic acid molecule encoding a 
polypeptide which is critical for survival and growth 
of the yeast Candida albicans and which nucleic acid 
molecule comprises any of the sequences of nucleotides 
illustrated in Seq ID Nos 1 or 2 and fragments or 
derivatives of said nucleic acid molecules. 



20 



4. A nucleic acid molecule according to any of 
claims 1 to 3 which is mRNA. 

^- ^ nucleic acid molecule according to any of 
claims L to 3 which is DNA. 

6. A nucleic acid molecule according to claim 5 
which is cDNA. 



30 



35 



7, A nucleic acid molecule capable of 
hybridising to the molecules according to any of 
claims 1 to 6 or the sequences illustrated in any of 
Seq ID Nos 1 to 9 under high stringency conditions. 

8. An antisense molecule comprising a nucleic 
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acid molecule capable of hybridising to the molecules 
according to any of claims 1 to 6 or the sequences 
illustrated in any of Seq ID Nos 1 to 9. 

5 9. Cells containing a nucleic acid molecule 

according to any of claims 1 to 8, wherein said cells 
are bacterial or eukaryotic. 

10. A polypeptide encoded by the nucleic acid 
10 molecule according to any of claims 1 to,. 7 or the 

sequences illustrated in any of Seq ID Nos 1 to 9. 

11. A polypetide having any of amino acid 
sequences illustrated in any of Seq ID Nos 14 to 20. 

15 

^2. A recombinant DNA construct comprising a 
nucleic acid molecule according to claim 5 or 6. 

13. A recombinant DNA construct comprising a 
20 nucleic acid molecule according to claim 5 or 6 

wherein said nucleic acid molecule is inserted in the 
antisense orientation. 

14. A recombinant DNA construct according to 

25 claim 12 or 13 wherein said recombinant DNA construct 
is an expression vector. 

15. A construct according to claim 14 which 
comprises an inducible promoter. 

30 

16. A construct according to claim 14 or 15 
which comprises a sequence encoding a reporter 
molecule. 



35 



17. Cells containing a recombinant DNA construct 
according to any of claims 12 to 16, wherein said 
cells are bacterial or eukaryotic. 



wo 00/34481 



PCT/EP99/09833 



- 39 - 

18. A nucleic acid molecule according to any of 
claims 1 to 8 or the nucleotide sequences illustrated 
in Seq ID Nos 1 to 9 for use as a medicameqj,. 

5 19. Use of a nucleic acid molecule according to 

any of claims 1 to 8 or the sequences illustrated in 
Seq ID Nos 1 to 9 in the preparation of a medicament 
for treating Candida albicans associated diseases. 

20. A polypeptide according to claim 10 or 11 
for use as a medicament. 

21. Use of a polypeptide according to claim 10 
or 11 in the preparation of a medicament for treating 

15 Candida albicans associated infections. 

22. A pharmaceutical composition comprising a 
nucleic acid molecule according to any of claims 1 to 
8 or a polypeptide according to claim 10 or 11 
together with a pharmaceutically acceptable carrier 
diluent or excipient therefor. 



20 



23. A Candida albicans cell comprising an 
induced mutation in the DNA sequence encoding the 

25 polypeptide according to claim 10. 

24. A method of identifying compounds which 
selectively modulate expression or functionality of 
polypeptides or metabolic pathways in which these 
polypeptides are involved and which are crucial for 
growth and survival of Candida albicans, which method 
comprises : 

(a) contacting a compound to be tested with one 
or more Candida albicans cells having a 
^5 mutation in a nucleic acid molecule 

according to any of claims 1 to 8 which 



30 
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mutation results in overexpression or 
underexpression of said polypeptides in 
addition to contacting one or mo5^_ wild type 
Candida albicans cells with said compound, 
5 (b) monitoring the growth and/or activity of 

said mutated cell compared to said wild 
type; wherein differential growth or 
activity of said one or more mutated Candida 
cells is indicative of selective action of 
10 said compound on a polypeptide or another 

polypeptide in the same or a parallel 
pathway. 

25. A compound identifiable according to the 
15 method of claim 24. 

26. A compound according to claim 25 for use as 
a medicament. 



20 



27. Use of a compound according to claim 25 in 
the preparation of a medicament for treating Candida 
albicans associated diseases. 

28. A pharmaceutical composition comprising a 
25 compound according to claim 25 together with a 

pharmaceutically acceptable carrier, diluent or 
excipient therefor. 




A method of identifying DNA sequences from a 
30 c^elijj^^Ji^ organism which DNA encodes polypeptides which 
are critical for growth or survival of said cell or 
organism, which method comprises: 

(a) preparing a cDNA or genomic library from 
said cell or organism in a suitable 
35 expression vector which vector is such that 

it can either integrate into the genome in 
said cell or that it permits transcription 
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of antisense RNA from the nucleotide 
sequences in said cDNA or genomic library, 
(b) selecting transf ormants exhibiting impaired 
growth and determining the nucleotide 
sequence of the cDNA or genomic sequence 
from the library included in the vector from 
said transf ormant. 

30. A method according to claim 29 wherein said 
cell or organism is a yeast or filamentous fungus. 



31. A method according to claim 29 or 30 wherein 
said cell or organism is any of Saccharomyces 
cerevisiae, Schizosaccharomyces pombe or Candida 

15 albicans . 

32. An antibody capable of binding to a 
polypeptide according to claim 10 or 11. 

20 33. An oligonucleotide comprising a fragment of 

from 10 to 120 contiguous nucleotides of a nucleic 
acid molecule according to any of claims 1 to 8. 

34. An oligonucleotide according to claim 33 
25 comprising a fragment of from 10 to 50 contiguous 
nucleotides . 
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AACAGCTGGT CTTCTGCTAA TACATTCAAC CCTTTCCA^TA TCTATACTCC 
1 50 

K^^QKklKTQ^K TAACTGATGA ACAATTGAAT ACCATTGCAT TGACATTTGG 
51 100 

TTTTGCTTCA ATAATATTAA TCATAATATA TCATGCCATA TCTACTAATG 
101 150 

TACATAAATT AGAAGATGAA ACCCCATCAT CTTCATTTAC CAGAACAAAT 
151 200 

ACTACTGAAA CTACTGTTGC AAGTAAGAAA AAGAAGTAAT AACTGATGGA 
201 250 

TTTTTCTTCC TACCACCAAT TGAATAATGC TAGACTTGTT GGTGTGCTAC 
251 300 

AAATATTTCA AAAGAAAATA CGAATACTTT ATAAAATGGT AAGAACGGAA 
301 350 

GATGGTTTCT CATTTATACA CTAAATACAA ATCACATACA CATACACAAA 
351 400 

CACAAATACA TACATACACC TATATCCCTT TATTTGAT 
401 438 



SUBSTITUTE SHEET (RULE 2$) 
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ATGTTAAAAA CACTAACTCA AACTTTACGC TTAACTGGGA AAGCTTTCCC 

1 50 

AAAGGTCCGT CCGGCCTTGA TCAGAACCTA CGCTGCCTTC GiCCGTTCTA 

51 100 

AACCTCATGT CAACATTGGT ACTATTGGTC ATGTTGATCA TGGTAAAACT 

101 150 

ACATTGACTG CTGCTATCAC CAAAGTTTTA GCCGAACAAG GTGGTGCCAA 

151 200 

CTTCTTGGAT TATGGTTCTA TTGATAGAGC TCCAGAAGAA AGAGCTAGAG 

201 250 

GTATCACTAT TTCCACTGCC CACGTTGAAT ACGAAACCAA GAACAGACAC 

251 300 

TATGCCCACG TTGATTGTCC AGGACACGCT GATTATATCA AAAATATGAT 

301 350 

TACTGGTGCC GCICAAATGG ATGGTGCTAT CATTGTTGTT GCTGCCACTG 

351 400 

ATGGTCAAAT GCCTCAAACC AGAGAACATT TGTTATTGGC CAGACAAGTT 

401 450 

GGTGTTCAAG ACTTGGTTGT GTTTGTCAAC AAAGTCGATA CTATTGATGA 

451 500 

CCCTGAAATG TTGGAATTAG TCGAAATGGA AATGAGAGAA TTGTTATCCA 

501 550 

CCTACGGTTT TGATGGTGAC AACACTCCAG TTATTATGGG ATCTGCTTTA 

551 600 

ATGGCTTTGG AAGACAAGAA ACCAGAAATT GGTAAGGAAG CTATCTTGAA 

601 650 

ATTGTTAGAT GCTGTCGATG AACACATTCC AACTCCATCA AGAGACTTGG 

651 700 

AACAACCATT TTTGTTACCA GTTGAAGACG TGTTCTCCAT CTCCGGTAGA 

701 750 



GGAACTGTTG TCACTGGTAG AGTTGAAAGA GGTGTTTTGA AGAAGGGTGA 
751 800 

AGAAATCGAA ATTGTTGGTG GTTTTGACAA ACCTTACAAG ACTACTGTTA 
801 850 

CCGGTATTGA AATGTTCAAA AAAGAATTAG ACTCTGCTAT GGCTGGTGAC 
351 900 
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AACTGTGGTG TTTTGTTAAG AGGTGTTAAA AGAGATGAAA TCAAGAGAGG 
901 950 

TATGGTTTTG GCCAAACCAG GTACTGCTAC TTCTCACAAG AAGTTCTTGG 
951 1000 

CTTCCTTGTA TATTTTGACT TCCGAAGAAG GTGGTCGTTC CACTCCATTT 
1001 105^' 

GGTGAAGGTT ACAAGCCTCA ATGCTTCTTC AGAACTAACG ATGTCACTAC 
1051 1100 

CACATTTTCA TTCCCAGAAG GAGAAGGTGT TGATCATTCT CAAATGATCA 
1101 1150 

TGCCAGGTGA CAACATTGAA ATGGTTGGTG AATTGATCAA ATCTTGTCCA 
1151 1200 

TTAGAAGTCA ACCAACGTTT CAACTTGAGA GAAGGTGGTA AAACTGTTGG 
1201 1250 

TACTGGTTTG ATTACCAGAA TCATCGAATA AACAGAATGT GCACTGTGAA 
1251 1300 

TAATAAAAAG AAAAGAGGTA TATATAGGTG ACTTTGTATT TTGTATTGAA 
1301 1350 

CAATAAAATT CTGTAAATAG TAAGGGCCTC 
1351 1380 
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GAATTCGCCCTTAAGCACTCGTTTCAACTATACATTCAGTAACAACACCCTTAATTTACCAAACT^ 
TTAATGGAAGTA 

ACACAACGGACGCAGAGTCAGACACAACCAACACAACAGTCACCGACAACTCAGACGCAAACCCAAAG 
CAAAGAGGACCA 

GAATAGGATTTGTCAATTGATTTGCTCCACGGGTCAGTTTGGCAATTATGATTTGAATATCAACGATA 
AAACTATCGTAC 

AAGGTAAAATGACGTGGTATTTTGGAAGAGACCCCAACTCAGATTTGCAAGTGGCGTCGTCGTCGAGA 
ATTTCAAACAAG 

CATTTTCAAATCTGGCTCAACTTCAATGATAAATCACTATGGATAAAGGACACTTCAACTAACGGGAC 
ACACCTTAACAA 

CAGTCGATTGGTGAAAGGATCAAACTACCTTCTTAATCAGGGTGATGAAATAGCAGTAGGGGTTGGT^ 
GAGACGAGGACG 

TTGTGAGGTTTGTCGTTGTCTTTGGTGACAAATACAACCCGGCAAAGCTACCTGATTCGACCAACACA 
ATTAAAGATGAA 

GGAATATACAAAGACTTTATTGTGAAAAATGAAACGATAGGCCAAGGAGCATTTGCCACTGTGAAAAA 
GGCGATTGAACG 

ATCTACGGGCGAGTCGTACGCGGTGAAGATTATAAATCGAAGAAAAGCATTAAATACCGGTGGTGGAA 
GTGCCATGGCAa 

GAGTGGACCG : '.iAATTGTCCATATTAGAGCGGCTCAACCACCCAAATATAGTTGCTCTAAAAGCTTTT 
TATGAAGATATG 

GACAATTACTATATTGTGATGGAATTGGTGCCGGGCGGTGATTTGATGGACTTTGTGGCTGCAAACGG 
TGCAATAGGAGA 

AGACGCAACACAAGTGATCACGAAACAGATTCTAGAAGGAATTGCCTATGTTCATAATTTAGGAATCT 
CCCATCGTGATT 

TGAAGCCAGATAATATTTTGATTATGCAAGATGACCCAATACTTGTTAAAATCACCGACTTTGGATTG 
GCAAAATTCAGT 

GACAATCTGACGTTTATGAAAACTTTTTGTGGTACATTGGCGTATGTTGCTCCCGAAGTTATCACCGG 
TAAGTATGGATC 

ATCGCAGATGGAACTGCAACAAAAGGACAACTACTCTTCCTTGGTTGACATTTGGTCTTTGGGATGTT 
TGGTTTATGTAC 

TTTTAACTTCTCATTTACCATTCAAtrGGGAAAAACCAGCAACAAATGTTTGCCAAG 
GAATTTCATGAG 

GCTCCATTAAATTCATACGACATTTCTGAAGACGGAAGAGATTTCTTGCAGTGCTGCCTACAGGTTAA 
TCCTAAACTAAG 

GATGACGGCTGCTGAAGCTTTGAAACATAAATGGTTGCAAGACITGTATGAAGAGGATTCTGTCAAAT 
CATTGAGTTTAT 

CGCAATCACAGTCGCAACAATCTCGAAAGATAGATAATGGTATCCATATCGAATCATTGAGCAAAATT 
GATGAAGACGTT 

ATGCTTCGTCCATTGGATAGCGAAAGAAATAGGAAATCAAGTAAACAGCAAGATTTCAACSGTACCCAA 
GCGTGTGATTCC 

GTTATCTCAACATCCTGCAACACCGTTACCAATGTCACAACCGAAAAAGAGGCCGTATCAAATAGAC^ 
CTAGAACAAACA 

AAAAAGTCGATTTGGAAGAACCTCTGACAAGCAAGAAAGTCAAGCTAAGTGATTCCGTTGTTGCGGAA 
GACTACTTGAAG 

TTGGGGCCACTTGCAAATTCGTTATTCCAAGAAACAATAAATATTTCAAAGTCCCCGTTTTCTT^ 
AAGAAATGACAC 

TTGTGATTGOSAGATAGACGACGACAGACTATCCAAACTTCATTGTGTCATTACCAAAGAAAACGACT 
CTATATGGTTAT 

TGGATAAGAGTACTAACTCGTGCTTGGTCAACAATACTAGTGTTGGAAAAGGOU^CAAAGTTT^ 
AGAGGAGGGGAG 

ATATTACATCrCITCTTTGACCCATTGTCACrGCAACATATAGGTTTC::^^ 

ACTGTCTGGTGA 

ACATAAGAGTCAAGTGGAGGTTTTGAAACAAACCTCAGAAGAAATGAATATTATTCCACTTA 
GTTTAAGTAGTA 

TAAGTTCATAGATTTAGCATATATACyU^GCATTTCCTATAGAAACAAAGGTTCATTAAT^ 
TACCTCCATGCA 

ATTACATTTACTTCTTCTTCCAAGGGCGAATTCTGCAGATATC 
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ATGGGTAGTATGTGAAGATACAATATTGAAAGTGTTTACTAGAATATCTAAGATGTTTGAGCCCATGG 
AC 

ATTTTTGGATTTGATAATTAAAAAAAGTAGCAATAGATTATTGCGTTGGAGAAAGAATCACCATAGTT 
GC 

AAGATTTGATAGATGTTAAAATGTTCACGCAGGCGAAAGATGTAACATCTCTTAAAGTAAGAAGAATA 
TG 

GACATGAATAAAAATAGATAGCACTATTTTGGAACTTGTTGAAGATATTAAA/.XAGAATGGGATTTCA 
AC 

ATAGATATTCAAAGTAACGAAACCrCACAATCAAATAAAAACAACAGTAATACTAACAATTCAATTTT 
TA 

TTTTTATAGAGGGTACrCCATCTTTAGGTAAACGTCACAACAAATCTCACACCTTATGTAACAGATGT 
GG 

CCGTCGTTCATTCCACGTCCAAAAGAAGACCTGTTCTTCTTGTGGTTACCCAGCTGCTAAAATGAGAT 
CT 

CACAACTGGGCCTTAAAAGCCAAAAGAAGAAGAACTACTGGTACCGGTAGAATGGCTTACTTGAAACA 
CG 

TTACCAGAAGATTCAAGAACGGTTTCCAAACTGGTGTTGCTAAAGCTCAAACCCCTTCCGCTTAAACT 
AA 

TTACTGAAGTTATTGGTCATGCAtTAGTCATTATTCATTAAAGTCATGTTAAGCATAGCAAAGGAAGA 
AT 

TGGTTAGATTCTTGTTTAAAATGTAATGACTATTTAATATCTGTTTAAATAAGAGGTTTAGTCTTTAT 
TT 

TTTTACGTATACACCAAAAAAAAAAGAAACAAATAAAATCTGTATATTAATGTTGG 
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ATGGGTACTA GTACAAGTGA AGCATTGAAG MK!ZKlQM^Kh ACAAACAGCG 
1 50 

AAGACAGAAA GTTTTTGCAG AAATAAAACA TGAAAAGAAT AAA^^ftACGTC 
51 100 

ATAAGCAAAG AGCCGAAAGA GCTAAGGAAG AAAGAGAAAA CCCCGAATTA 
101 150 

AGAGAGGAAA GAATAGCAGC TAATATCCCA GATACTATAG ATAGCAAACG 
151 200 

TATTTATGAT GAGACTATAG CTGCTGAAGT TGAAGGAGAT GACGAGTTTC 
201 250 

AGTCATATTT CACTAACTTG TTGGAAGAAC CAAAGATTTT GTTGACAACA 
251 300 

AGTGCCAATG CTAAAAAACC GGCCTATGAA TTTGCAGACA TGATCATGGA 
301 350 

CTTTTTACCG AATGTGACAT TTATCAAAAG GAAGAAGGAA TATACAATGC 
351 400 

AAGATATGGC CAAATATTGC TCGAATAGAG ACTTCACTGC ATTGCTTGTC 
401 450 

ATCAACGAAG ACAAGAAGAA GGTCAATGGT ATAACGCTCA TCAATTTACC 
451 500 

TGAAGGGCCA ACATTTTATT TTTCGATTAC ATCAATAGTT GATGGGAAAA 

501 ^ ^ 550 

GAATTAAGGG ACACGGGAAA GCTGGTGATT ATTTACCTGA GATTGTATTG 
551 600 

AATAATTTCA ATTCAAGATT GGGTAAAACT GTGGGAAGAC TATTTCAAAG 
601 650 

TATTTTCCCT CATAAACCTG AACTTCAAGG AAGACAAGTG ATTACTTTGC 
651 700 

ACAATCAACG TGATTATATT TTTTTCAGAA GACATAGATA TATTTTCAGA 
701 750 

AATGAGGAAA AGGTTGGATT GCAGGAATTG GGTCCGCAGT TTACATTAAA 
751 800 

GCTAAGAAGA ATGCAAAAGG GAGTACGTGG TGATGTTGTT TGGGAACACA 
801 850 

GACCAGATAT GGAAAGAGAT AAGAAGAAGT TTTATTTATA AGCGGGTGTA 
851 900 
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7/^3 



TAAAGGTAGT AGTAGTGCGT TTATAAGTAT GTGTGTGTGT TTATGCATAG 
901 950 

ATGTGTAAAG AGTAATACAG CTAATTCG 
951 978 
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AACTAATTTG TTTAAACATC AATACCAAGA AGTTTTTACA ATTCAATCCC 
1 50 

ACATACACCA TTAATTATGA ATTCTGAAAA GATTATTGAA GTTATCATTG 
51 100 

CTATTTTCTT ACCACCAGTA GCTGTGTTTA TGAAATGTGG TGCCACTACC 
101 150 

CCATTATGGA TTAACTTGGT ATTATGTATC TTTATTTGGT TCCCTGCTAT 
151 200 

CTTACATGCC TTATACGTTG TGTTGAAAGA TTAAACAAAC ACCAGAGATT 
201 250 

TACTGCTTGA TGAATTGATT ACTCCAAAGA GTTGTGACTA GTTCCCAGTG 
251 300 

TGTTTTTTTT GCCTTCCAAC TTTCTTTTAC ATTTTTCCAT TACTACCACT 
301 350 

GTCTTCCCCC CTATTTTGGA GAGTTTTCAA AATTTATCCA AAACATGTTA 
351 400 

GTCATTAAAC CATATTATTA TAATTATTCT TTTTTGTATT TTTTTCCCTT 
401 450 

AAAACACGTT AATTTATTAA TCGTTTCGTT GTTTGGTATT TTATTTTTTT 
451 500 

GTATTTATCA ATTGGAATAT ATATCTATAC ATGAATTTAT TATCCATTGT 
501 550 

ACCAATTGTT AAAACATTTT GTTAGTTTTT TGTTACTAGT ATAAAANNAT 
551 600 

AATAAAAGTT TANTTCAAC 
601 619 
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ATGACATTAG GGTTCGATAA ATTCATAAGC AAGGTCAGCA CTCATAGACG 
1 50 

TCAATCTGAA CCATCAATCT TGGAAATCGC AGCCACCAAT TCTCAAAATA 
51 100 

AATCGAGAAG GCTAAGTATG GATAATGGTC ATTGTTATGT TCGTGAATCA 
101 150 

ACTAATAATC ATCATCATTT AAATACCGTC GTTGATAATT TACGACAGCG 
151 200 

TGCGGGATCG TTTTCATTTA TTTCACATCA CCATAATCAC CATCAGAATA 
201 250 

GTCACGATAA TTATACTGTC GATCCCCTTA CATCAAACGG AGCACGAATT 
251 300 

TCCCGATCAC GTTCACGTTC CAAATCAGTT GGGCACGGAG AAGCAATATC 
301 350 

ACCAGCGTAT TTTTCCAAGA AT/;'JVACCAA AGATTTAGTG AAACAGGAAA 
351 400 

CAGCACATAT CATTCTGAAG AAATTACTCA ACATGTTACA AGATTTGGAT 
401 450 

TTACAAAACC CTATTGCATT GAAAACAATA TCACAAGGTT CAGAATCAAA 
451 500 

GTTTTGTAAA ATCTACGTGT CTAACACTAA TAATTGTATT TACTTACCAG 
501 550 

CAGCAAGTTC AACAAGTTTC ACTTATGAAG ATGATGAAAA TGGCGGCGTT 
551 600 

ATAATTGCTG AAGATAGAAA TGATGAAATG CCAACAGCAG TTAATAACAA 
601 650 

TACTTTGTCA ATGGATAGTA TAAATCATTC AGAGACTGAT TTCCTGGATT 
651 700 

CTCCACCACC TCCAGATTTA TTTTCTAAAA TGAAATCATT CCATTCACCA 
701 750 

AATTACTTGA CTTCAAAAAT CGATTCTGAA TGTCCAATTC CACATACATT 
751 800 

TGCTGTGATT GTTGAATTAA CCAAGGACTC TTTGATTATT AAAGATCTTC 
801 850 

ATTTCCAATT TCAGTCATTA ACTACCATTT TATGGCCAAC TGGGGATGCA 
851 900 
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TATAATCGGA CTCATGCCAA GGAGAAATTT ACCATTGGGA ATATGGAATG 
901 950 

GCGTACATCT TTAAGCGACG CCGACTATTA TATCAATAGT TCTAATTCCA 
951 1000 

ACGATGTTAA GCTGAAAAAC TTGGGTCCTG AAGATCTTAT TAATCGAACT 
1001 1050 

AGAGAATACA AATTAATCGA TATTGAAGAA CCAAACAATT- CATCAAACAG 

1051 T 1 on 

TTTACTGGAT GATGACATGG KTKlThKlhK TATTACGTCG CCATTATCAA 
1101 1150 

CGTCACCAAC ATCAAGTTCA ACTTCAACAA ATTCAACCTC CAACTCATTG 
1151 1200 

GGTTCAGATT CATATAAAGC TGGTCTTTAT GTATTTTTAT TACCAATCTT 
1201 1250 

ATTGCCAGAA CATATTCCTG CTTCCATTGT TTCTATTAAT GGTTCATTGG 
1251 1300 

CTCATACATT ACTGGTTGAA TGCAATAAAT ATACTGATAA GTTGAATCGG 
1301 1350 

AAATCAAAAG TATCAGCATC GTACAATTTA CCTATGGTCC GTACTCCACC 
1351 ^ 1400 

AAACATTGGT AATTCCATTG CTGATAAGCC AATTTATGTT AATAGGATTT 
1401 1450 

GGAATGATGC CGTACATTAT ATTATAACTT TCCCCCGCAA ATATGTTACT 
1451 1500 

TTGGGTTGTG AACACATGAT AAATGTGAAA TTACTGCCCA TGGTGAAAGA 
1501 1550 

TGTGGTTATC AAGCGTATTA AATTTAATGT ATTGGAGAGA ATAACTTATG 
1551 1600 

TTTCCAAAAA TTTATCACGA GAATATGATT ATGATAGTGA AGACCCCTAT 
1^01 1650 

TGTATTCATC CAGTTTCTAA AGAAAATAAA GTACGTGAAC GTGTTGTGTC 
1651 1700 

GTTATATGAA TTGAAAACGA AGGCAAAACA ATCTTCTGGT GGACATCTTG 
1701 1750 

AAGCTTATAA ACAAGAAGTT ATGAAATGTC CGGAAAATAA CCTTTTATTT 
1751 1800 
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TCTTGTTATG AGGTTGAAAA TGATAATAAT AACGGCAACG GCAACGGCAA 
1801 1850 



1851 1900 



TTGCTACACC TTTAGATATC AATGTTTCTT TACCATTTTT AACTACTATG 

1901 1950 

TCTGATAGTT TAAHTATGAC ATCAGCCATA GAAGAAGAAG GTTCAGATCT 

1951 2000 

GCCTCATACA TCAAGAAGAG GGTCGGCAGT GAGTATGACT GATAATAATA 

2001 2050 

CTACCCCAAG TAACAATAAC CCTTTATCTC CATTTTTGGG AGCAGTGGAA 

2051 2100 

ACTAATGGTG CTAGTATAAA TGAAATTGGT GATCATACAT TATTCCCTGA 

2101 2150 

TTCTAATTTT CGACATA'TTG AAATTAAACA TCGATTACAA GTTACATTTA 

2151 2200 

GGATTTCTAA ACCGGATCTG GATAATAT^ TGCATCATTA TGAAGTGGTT 

2201 2250 

ATTGATACCC CCATCGTTTT ACTTAGTTCA AAATGTCAAG AAGATTCTCC 

2251 2300 

TCCTCCTTAT AGTTCTGTA 
2301 2319 
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AACGTTCGTG CAAAAGGCTA TACTGGTGAT ATCCACGCAG ATGAAGAGCA 
1 50 

AGTTTAATCA ACTCTTTGTC AATTAATGCT GTACTTGTTT TCATTTTATT 
51 100 

TGCTGGCATT TAAAGAATAC CCATAGTTCA GAAAATAAAA TTGAAAAATT 
101 150 

TAAAAAAAAA CGCAATATCA TTCATTTTTT TTGTTTTTTT GACAATAATA 
151 200 

TTAATATGTA GTTACCAATG TTTTTAGATT TTATATGTTT TGAAAAAATA , 
201 250 

GTTTG 
251 



AACCTTACAA TCATTATACC AACTATCAAA ATCATAAGAC TCTTNAACTT 
1 50 

CTGTTTTTGA TAGTTGGTAT AATGATTTAT GTATTATCTT AATTCATTAT 
51 100 

TATTAGTTTC GGTCACAAA 
101 119 
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CYCT 
5amHI(6901) 
^val(6842) 
Pst I (6635) 
hlFNIS 

^vfll(6067) 
5awH I (6035) 
.4vfll(6029) 

£coR I (5704) 
M/2d III (5403) 
Pgail 
////id .'I (5363) 

ApaU (4999) 

i:coR I (4957) 



CaURA3 

£coR I (3827) 



ORI 




^paLI (704) 



AMP 



^Ll(1950; 



ApaU (2447) 
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1 


AGCTTGAGTA 


TTCTATAGTG 


TCACCTAAAT 


AGCTTGGCGT 


AATCATGGTC 


51 


ATAGCTGTTT 


CCTGTGTGAA 


ATTGTTATCC 


GCTCACAATT 


CCACACAACA 


101 


TACGAGCCGG 


AAGCATAAAG 


TGTAAAGCCT 


GGGGTGCCTA 


A7GAGTGAGC 


151 


TAACTCACAT 


TAATTGCGTT 


GCGCTCACTG 


CCCGCTTTCC 


AGTCGGGAAA 


201 


CCTGTCGTGC 


CAGCTGCATT 


AATGAATCGG 


CCAACGCGCG 


GGGAGAGGCG 


251 


GTTTGCGTAT 


TGGGCGCTCT 


TCCGCTTCCT 


CGCTCACTGA 


CTCGCTGCGC 


301 


TCGGTCGTTC 


GGCTGCGGCG 


AGCGGTATCA 


GCTCACTCAA 


AGGCGGTAAT 


351 


ACGGTTATCC 


ACAGAATCAG 


GGGATAACGC 


AGGAAAGAAC 


ATGTGAGCAA 


401 


AAGGCCAGCA 


AAAGGCCAGG 


AACCGTAAAA 


AGGCCGCGTT 


GCTGGCGTTT 


451 


TTCCATAGGC 


TCCGCCCCCC 


TGACGAGCAT 


CACAAAAATC 


GACGCTCAAG 


501 


TCAGAGGTGG 


CGAAACCCGA 


CAGGACTATA 


AAGATACCAG 


G CGTTTCCCC 


551 


CTGGAAGCTC 


CCTCGTGCGC 


TCTCCTGTTC 


CGACCCTGCC 


GCTTACCGGA 


€01 


TACCTGTCCG 


CCTTTCTCCC 


TTCGGGAAGC 


GTGGCGCT'^ r 


CTCATAGCTC 


651 


ACGCTGTAGG 


TATCTCAGTT 


CGGTGTAGGT 


CGTTCGCTCC 


AAG CTGGG CT 


701 


GTGTGCACGA 


accccccgtt 


CAGCCCGACC 


GCTGCGCCTT 


ATCCGGTAAC 


751 


TATCGTCTTG 


AGTCCAACCC 


GGTAAGACAC 


GACTTATCGC 


CACTGGCAGC 


801 


AGCCACTGGT 


AACAGGATTA 


GCAGAGCGAG 


GTATGTAGGC 


GGTGCTACAG 


851 


AGTTCTTGAA 


GTGGTGGCCT 


AACTACGGCT 


ACACTAGAAG 


GACAGTATTT 


901 


GGTATCTGCG 


GTCTG CTG AA 


GCCAGTTACC 


TTCGGAAAAA 


GAGTTGGTAG 


951 


CTCTTGATCC 


GGGAAACAAA 


CCACCGCTGG 


TAGCGGTGGT 


TTTTTTGTTT 


1001 


GCAAGCAGCA 


GATTACGCGC 


AGAAAAAAAG 


GATCTCAAGA 


AGATCCTTTG 


1051 


ATCTTTTCTA 


CGGGGTCTGA 


CGCTCAGTGG 


AACGAAAACT 


CACGTTAAGG 


1101 


GATTTTGGTC 


ATGAGATTAT 


CAAAAAGGAT 


CTTCACCTAG 


ATCCTTTTAA 


1151 


ATTAAAAATG 


AAGTTTTAAA 


TCAATCTAAA 


GTATATATGA 


GTAAACTTGG 


1201 


TCTGACAGTT 


ACCAATGCTT 


AATCAGTGAG 


GCACCTATCT 


CAGCGATCTG 


1251 


TCTATTTCGT 


TCATCCATAG 


TTGCCTGACT 


CCCCGTCGTG 


TAGATAACTA 


1301 


CGATACGGGA 


GGGCTTACCA 


TCTGGCCCCA 


GTGCTGCAAT 


GATACCGCGA 
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1351 GACCCACGCT CACCGGCTCC AGATTTATCA GCAATAAACC AGCCAGCCGG 

1401 AAGGGCCGAG CGCAGAAGTG GTCCTGCAAC TTTATCCGCC TCCATCCAGT 

1451 CTATTAATTG TTGCCGGGAA GCTAGAGTAA GTAGTTCGCC AGTTAATAGT 

1501 TTGCGCAACG TTGTTGCCAT TGCTACAGGC ATCGTGGTGT CACGCTCGTC 

1551 GTTTGGTATG GCTTCATTCA GCTCCGGTTC CCAACGATCA AGGCGAGTTA 

1601 CATGATCCCC CATGTTGTGC AAAAAAGCGG TTAGCTCCTT CGGTCCTCCG 

1651 ATCGTTGTCA GAAGTAAGTT GGCCGCAGTG TTATCACTCA TGGTTATGGC 

1701 AGCACTGCAT AATTCTCTTA CTGTCATGCC ATCCGTAAGA TGCTTTTCTG 

1751 TGACTGGTGA GTACTCAACC AAGTCATTCT GAGAATAGTG TATGCGGCGA 

1801 CCGAGTTGCT CTTGCCCGGC GTCAATACGG GATAATACCG CGCCACATAG 

1851 CAGAACTTTA AAAGTGCTCA TCATTGGAAA ACGTTCTTCG GGGCGAAAAC 

1901 TCTCAAGGAT CTTACCGCTG TTGAGATCCA GTTCGATGTA ACCCACTCGT 

1951 GCACCCAACT GATCTTCAGC ATCTTTTACT TTCACCAGCG TTTCTGGGTG 

2001 AGCAAAAACA GGAAGGCAAA ATGCCGCAAA AAAGGGAATA AGGGCGACAC 

2051 GGAAATGTTG AATACTCATA CTCTTCCTTT TTCAATATTA TTGAAGCATT 

2101 TATCAGGGTT ATTGTCTCAT GAGCGGATAC ATATTTGAAT GTATTTAGAA 

2151 AAATAAACAA ATAGGGGTTC CGCGCACATT TCCCCGAAAA GTGCCACCTG 

2201 ACGTCTAAGA AACCATTATT ATCATGACAT TAACCTATAA AAATAGGCGT 

2251 ATCACGAGGC CCTTTCGTCT CGCGCGTTTC GGTGATGACG GTGAAAACCT 

2301 CTGACACATG CAGCTCCCGG AGACGGTCAC AGCTTGTCTG TAAGCGGATG 

2351 CCGGGAGCAG ACAAGCCCGT CAGGGCGCGT CAGCGGGTGT TGGCGGGTGT 

2401 CGGGGCTGGC TTAACTATGC GGCATCAGAG CAGATTGTAC TGAGAGTGCA 

2451 CCATATGCGG TGTGAAATAC CGCACAGATG CGTAAGGAGA AAATACCGCA 

2501 TCAGGCGAAA TTGTAAACGT TAATATTTTG TTAAAATTCG CGTTAAATAT 

2551 TTGTTAAATC AGCTCATTTT TTAACCAATA GGCCGAAATC GGCAAAATCC 

2601 CTTATAAATC AAAAGAATAG ACCGAGATAG GGTTGAGTGT TGTTCCAGTT 

2651 TGGAACAAGA GTCCACTATT AAAGAACGTG GACTCCAACG TCAAAGGGCG 
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2701 AAAAACCGTC TATCAGGGCG ATGGCCCACT ACGTGAACCA TCACCCAAAT 

2751 CAAGTTTTTT GCGGTCGAGG TGCCGTAAAG CTCTAAATCG GAACCCTAAA 

2801 GGGAGCCCCC GATTTAGAGC TTGACGGGGA AAGCCGGCGA ACGTGGCGAG 

2 851 AAAGGAAGGG AAGAAAGCGA AAGGAGCGGG CGCTAGGGCG CTGGCAAGTG 

2 901 TAGCGGTCAC GCTGCGCGTA ACCACCACAC CCGCCGCGCT TAATGCGCCG 

2951 CTACAGGGCG CGTCCATTCG CCATTCAGGC TGCGCAACTG TTGGGAAGGG 

3001 CGATCGGTGC GGGCCTCTTC GCTATTACGC CAGCTGGCGA AAGGGGGATG 

3051 TGCTGCAAGG CGATTAAGTT GGGTAACGCC AGGGTTTTCC CAGTCACGAC 

3101 GTTGTAAAAC GACGGCCAGT GAATTGrAAT ACGACTCACT ATAGGGCGAA 

3151 TTGGTTTTCC AATGATGAGC ACTTTTAAAG TTCTGCTATG TGGCGCGGTA 

3201 TTATCCCGTG TTGACGCCGG GCAAGAGCAA CTCGGTCGCC GCATACACTA 

3251 TTCTCAGAAT GACTTGGTTG AGTACTAATA GGAATTGATT TGGATGGTAT 

3301 AAACGGAAAC AAAAAAAAGA GCTGGTACTA CTTTCTTTAA AATTATTTTA 

3351 TTATTTGATT TTATTTAATA GTATATATTA TATTTTGAAC GTAGATTATT 

3401 TTGTTGAAAG TTGCTGTAGT GCCATTGATT CGTAACACTA ATTCTGTATT 

3451 AGTCATTCCT CTTGTTTGAT AGTATCCAAA AAAACGGCTA TTTTTTTGCA 

3501 ATCTTATTTC CTGCATATTA TACAGATAAC ATAATGAAAG AAAAAATCTT 

3551 TTTTTTTGTT CTTCAATGAT GATTTCAACC ATTCTTTTAA ACATTGATCA 

3601 ATTCCTGAGC AACAACCCCA TACACACTGG TTTATATACC GCCCCTTTTA 

3651 CAGTTGAAGA AAGAAATAGA AATAGAAATA GCAAACAAAA GATATGACAG 

3701 TCAACACTAA GACCTATAGT GAGAGAGCAG AAACTCATGC CTCACCAGTA 

3751 GCACAGCGAT TATTTCGATT AATGGAACTG AAGAAAACCA ATTTATGTGC 

3801 ATCAATTGAC GTTGATACCA CTAAGGAATT CCTTGAATTA ATTGATAAAT 

3851 TAGGTCCTTA TGTATGCTTA ATCAAGACTC ATATTGATAT AATCAATGAT 

3901 TTTTCCTATG AATCCACTAT TGAACCATTA TTAGAACTTT CACGTAAACA 
3951 TCAATTTATG ATTTTTGAAG ATAGAAAATT TGCTGATATT GGTAATACCG 
4001 TAAAGAAACA ATATATTGGT GGAGTTTATA AAATTAGTAG TTGGGCAGAT 
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4 051 ATTACCAATG CTCATGGTGT CACTGGGAAT GGAGTGGTTG AAGGATTAAA 

4101 ACAGGGAGCT AAAGAAACCA CCACCAACCA AGAGCCAAGA GGGTTATTGA 

415 1 TGTTAGCTGA ATTATCATCA GTGGGATCAT TAGCATATGG AGAATATTCT 

4201 CAAAAAACTG TTGAAATTGC TAAATCCGAT AAGGAATTTG TTATTGGATT 

4251 TATTGCCCAA CGTGATATGG GTGGCCAAGA AGAAGGATTT GATTGGCTTA 

4 301 TTATGACACC TGGAGTTGGA TTAGATGATA AAGGTGATGG ATTAGGACAA 

4 351 CAATATAGAA CTGTTGATGA AGTTGTTAGC ACTGGAACTG ATATTATCAT 

44 01 TGTTGGTAGA GGATTGTTTG GTAAAGGAAG AGATCCAGAT ATTGAAGGTA 

4451 AAAGGTATAG AAATGCTGGT TGGAATGCTT ATTTGAAAAA GACTGGCCAA 

4501 TTATAAATGT GAAGGGGGAG ATTTTCACTT TATTAGATTT GTATATATGT 

4551 AGAATAAATA AATAAATAAG TTAAATAAAT AATTAAATAA GGGTGGTAAT 

4601 TATTACTATT TACAATCAAA GGTGGTCCTT CTAGCTGTAA TCCGGGCAGC 

4651 GCAACGGAAC ATTCATCAGT GTAAAAATGG AATCAATAAA GCCCTGCGCA 

4701 GCGCGCAGGG' TCAGCCTGAA TACGCGTTTA ATGACCAGCA CAGTCGTGAT 

4751 GGCAAGGTCA GAATAGCCCA AGTCGGCCGA GGGGCCTGTA CAGTGAGGGA 

4801 AGATCTGATA TTGACGAAGA GGAACCAATG TAACGTTACA CTGAAGAAAA 

4851 CACATAATAA ACGGGAAGAA ACGGTGTAAA AGTGTGAAAA TAATTTTTGA 

4901 ATATCATTTC CCTTGGTTTA ATTCCAAACG AAACGTGTAT TTTTTTAGAG 

4951 AATGGGAATT CTTATTGGAT GTCTAGATTG TTTGTTTACT CCAGACTGTG 

5001 CACAAAAACG TTTGGATGGA TGATCAGAAG ATATTTTTAG GCTTAGCTCT 

5051 AAATATAAGA AATGATGCTT GAAAATCCAG ACAGAAATTG AGTTTCAAAA 

5101 ATTGGTAATG TGAGGTATTA GTCAACTAAC CAAATAACAA TGCAAACCGG 

5151 TTGATACATT TCATTTTGAA AATAATGAAA CTGGAATTGG ATGACCAGCA 

5201 CACAAACACA TAAAGTAATT ATGGGAATTA GAAGCGAACA TAGAGGAATA 

5251 CTTTGCCACG AACAGAATAC AAGTGGGAAC ACTTTTTTCT CCATTGTTTT 

5301 AGTTCTGTTT TTTTGTCAAA CTGGTTTTGT GCTATGTGTA AAAAAATATT 

5351 GCCAAGAAAA AAAGCTTGTT TTGTGGCCAG TGTCCGAAAA AAATTTTGGG 
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5401 


GAAGCTTCGG 


ATTAATTTAT 


TTTTTTATTC 


CATCGGGGAA 


AGTGGGGGGG 


5451 


AAAAAAAATT 


TAAGCAGTTC 


ATAAAACCTT 


CCAAAAAATA 


TATGGACAGA 


5501 


GATGATTGTA 


TTTTCCCGAC 


ACCAAAATCA 


TAATTAACTA 


TGAGAAAATT 


5551 


GAATGTAACG 


TTACAATTTA 


TTTTTATTTG 


AAGCTGAAAA 


G CGATTTA TG 


5601 


ATTTTTCCGA 


AATGAAAATT 


TxrrrrAGGT 


TTATTTTTTT 


TGTCGGGCAA 


5651 


AGAAAAACTG 


AACAAGGATT 


ATTAAAATTT 


TTGGTGTTTG 


TTTGTGTCTG 


5701 


GAGAATTCAT 


TCCTCTCTCA 


TCTTCACACA 


ATGTTTAGAC 


ATCTGACACG 


5751 


ATTCAAAATA 


GTTrC-GTTTC 


CGGGGTTGGT 


GTTTAGTTTT 


CGTTTTTCGT 


5801 


TTTTTTTGGA 


AAGAATGTTT 


TAGCTCATTG 


GTTTTCTTTC 


TTCATTCAAT 


5851 


AGTTTTGAAA 


GAATTTGCCC 


ACTTGTTATT 


ACAATCATAT 


AAAATTAAAC 


5901 


TTTGATATAA 


AATAGAGTTT 


GAAAGTTTCC 


CAGATCCTTT 


TTGATTTCTT 


5951 


TGTAATTTTT 


TTTTCTCCCA 


CATATACACA 


CATACAAACC 


GATlTTiATA 


6001 


AGAAAGAGTT 


ATACCCTGCA 


GCTCGACCTC 


GAGGGATCCG 


GGCCCTCTAG 


6051 


ATGCGGCCGC 


TAGGCCTCGA 


GGGACTTTTG 


CACCAAAAAT 


AATTTATTTT 


6101 


CCAAAATAAA 


ATTTAAATAA 


ATAAAAATAA 


CTCATAATTT 


AATAAAAATT 


6151 


TCAAAATCTT 


CTAGTGTCCT 


TTCATATGCA 


GTACATTAGC 


CATCAGTCAC 


6201 


TTAAACAGCA 


TCTGCTGGTT 


GAAGAATGCT 


TGAAGCAATT 


GTCCAGTCCC 


6251 


AGAGGCACAG 


GCTAGGAGAT 


CTTCAGTTTC 


GGAGGTAACC 


TGTAAGTCTG 


6301 


TTAATGAAGT 


AAAAGTTCCT 


TAGGATTTCC 


ACTCTGACTA 


TGGTCCAGGC 


6351 


ACAGTGACTG 


TACTCCTTGG 


CCTTCAGGTA 


ATGCAGAATC 


CTCCCATAAT 


6401 


ATCTTTTCAG 


GTGCAGACTG 


CTCATGAGTT 


TTCCCCTGGT 


GAAATCTTCT 


6451 


TTCTCCAGTT 


TTTCTTCCAG 


GACTGTCTTC 


AGATGGTTTA 


TCTGATGATA 


6501 


GACATTAGCC 


AGGAGGTTCT 


CAACAATAGT 


CTCATTCCAG 


CCAGTGCTAG 


6551 


ATGAATCTTG 


TCTGAAAATA 


GCAAAGATGT 


TCTGGAGCAT 


CTCATAGATG 


6601 


GTCAATGCGG 


CGTCCTCCTT 


CTGGAACTGC 


TGCAGCTGCT 


TAATCTCCTC 


6651 


AGGGATGTCA 


AAGTTCATCC 


TGTCCTTGAG 


GCAGTATTCA 


AGCCTCCCAT 


6701 


TCAATTGCCA 


CAGGAGCTTC 


TGACACTGAA 


AATTGCTGCT 


TCTTTGTAGG 
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6751 


AATCCAAGCA 


AGTTGTAGCT 


CATGGAAAGA 


GCTGTAGTGG 


AGAAGCACAA 


6B01 


CAGGAGAGCA 


ATTTGGAGGA 


GACACTTGTT 


GGTCATGTTC 


CTCGAGGCCT 


6851 


TTTTGGCCAG 


CTGGCGCCTG 


CTGCGCGACG 


GCGAGCTGCT 


CACCACCCAG 


6901 


GATCCGTCCC 


CCTTTTCCTT 


TGTCGATATC 


ATGTAATTAG 


TTATGTCACG 


6951 


CTTACATTCA 


CGCCCTCCCC 


CCACATCCGC 


TCTAACCGAA 


AAGGAAGGAG 


7001 


TTAGACAACC 


TGAAGTCTAG 


GTCCCTATTT 


ATTTTTl'TAT 


AGTTATGTTA 


7051 


GTATTAAGAA 


CGTTATTTAT 


ATTTCAAATT 


TTTCTTTTTT 


TTCTGTACAG 


7101 


ACGCGTGTAC 


GCATGTAACA 


TTATACTGAA 


AACCTTGCTT 


GAGAAGGTTT 


7151 


TGGGACGCTC 


GAAGGCTTTA 


ATTTGCA 
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Pgall 
Hirv^ III (6557) 
ApalA (6194) 
£:coRI(6152) 



CaURAS 



>J vol (4945) 




pGAL1PSiST-1 

6621 bp 



ApalA (4218) 



ApalA (3721) 




AMP 



Ec<^ I (276) 

Pjrl(594) 
Pj/I(621) 
fZ/nd III (629) 
^ vol (647) 
Pst I (862) 
hlFN 
3UTR 
^val(1422) 
SflmHI(I446) 
Amfll(1451) 
^vfll(1451) 
5ma I (1453) 
£coRI(1486) 
Amfll(l502) 
^val(1502) 
5/7wiI(1504) 
CYCT 
Cla 1 (2099) 
OR I 
ApalA {:i^l5) 
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1 TTCCATCGGG GAAAGTGGGG GGGAAAAAAT TTTAAGCAGT TCACAAAACC 

51 TTCCAAAAAA TATATGGACA AAGATGATTG TATTTTCCCG ACACCAAAAT 

101 CATAATTAAT TATGAGAAAG TTAAATGTAA CGTTACAATT TATGTTTATT 

151 TGAAGGTGAA AAGCGATTTA TGATTTTTCC GAAATGAAAA TTTTTTTTAG 

201 GTTTATTTTT TTTGTCGGGC AAAGAAAAAC TGAACAAGGA TTATTAAAAT 

251 TTTTGGTGTT TGTTTGTGTC TGGAGAATTC ATTCCTCTCT CATCTTCACA 

301 CAATGTTTAG ACATCTGACA CGATTCATGA TAGTTCGGTT TCCGGGGTTG 

351 GTGTTTAGTT TTCGTTTTTC TTTTTTTTTG GAAAGAATGT TTTAGCTCAT 

4 01 TGGTTTTCtT TCTTCATTCA ATAGTTTTGA AAGAATTTGC CCACTTGTTA 

451 TTACAATCAT ATAAAATTAA ACTTTGATAT AAAATAGAGT TTGAAAGTTT 

501 CCCAGATCCT TTTTGATTTC TTTGTAAATT TTTTTTTCTC CCACAT.aTAC 

551 ACACATACAA ACCGATTTTT ATAAGAAAGA GTTATACCCT GCAGLICGAC 

601 CTCGACTGTT TAAACCTGCA GGCATGCAAG CTTGGCCAAA AAGGCCTCGA 

651 GGAACATGAC CAACAAGTGT CTCCTCCAAA TTGCTCTCCT GTTGTGCTTC 

701 TCCACTACAG CTCTTTCCAT GAGCTACAAC TTGCTTGGAT TCCTACAAAG 

751 AAGCAGCAAT TTTCAGTGTC AGAAGCTCCT GTGGCAATTG AATGGGAGGC 

8 01 TTGAATACTG CCTCAAGGAC AGGATGAACT TTGACATCCC TGAGGAGATT 

851 AAGCAGCTGC AGCAGTTCCA GAAGGAGGAC GCCGCATTGA CCATCTATGA 

901 GATGCTCCAG AACATCTTTG CTATTTTCAG ACAAGATTCA TCTAGCACTG 

951 GCTGGAATGA GACTATTGTT GAGAACCTCC TGGCTAATGT CTATCATCAG 

1001 ATAAACCATC TGAAGACAGT CCTGGAAGAA AAACTGGAGA AAGAAGATTT 

1051 CACCAGGGGA AAACTCATGA GCAGTCTGCA CCTGAAAAGA TATTATGGGA 

1101 GGATTCTGCA TTACCTGAAG GCCAAGGAGT ACAGTCACTG TGCCTGGACC 

1151 ATAGTCAGAG TGGAAATCCT AAGGAACTTT TACTTCATTA ACAGACTTAC 

1201 AGGTTACCTC CGAAACTGAA GATCTCCTAG CCTGTGCCTC TGGGACTGGA 

1251 CAATTGCTTC AAGCATTCTT CAACCAGCAG ATGCTGTTTA AGTGACTGAT 

1301 GGCTAATGTA CTGCATATGA AAGGACACTA GAAGATTTTG AAATTTTTAT 
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1351 


TAAATTATGA 


GTTATTTTTA 


TTTATTTAAA 


TTTTATTTTG 


GAAAATAAAT 


1401 


TA'ITrTTGGT 


GCAAAAGTCC 


CTCGAGGCCT 


AGCGGCCGCC 


TAGAGGATCC 


1451 


CCGGGCGCTA 


GGCGGCCGCT 


AGGCCTTTTT 


GGCCGAATTC 


GAGCTCGGTA 


1501 


CCCGGGGAGA 


TCCGTCCCCC 


TTTTCCTTTG 


TCGATATCAT 


GTAATTAGTT 


1551 


ATGTCACG CT 


TACATTCACG 


CCCTCCCCCC 


ACATCCGCTC 


T AAC CG AAAA 


1601 


GGAAGGAGTT 


AGACAACCTG 


AAGTCTAGGT 


CCCTATTTAT 


TTTTTTATAG 


1651 


TTATGTTAGT 


ATTAAGAACG 


TTATTTATAT 


TTCAAATTTT 




1701 


CTGTACAGAC 


GCGTGTACGC 


ATGTAACATT 


ATACTGAAAA 


CCTTGCTTGA 


1751 


GAAGGTTTTG 


GGACGCTCGA 


AGGCTTTAAT 


TTGCAAGCTA 


GCTTGGCGTA 


laoi 


ATCATGGTCA 


TAGCTGTTTC 


CTGTGTGAAA 


TTGTTATCCG 


CTCACAATTC 


1851 


CACACAACAT 


ACGAGCCGGA 


AGCATAAAGT 


GTAAAGCCTG 


GGGTGCCTAA 


1901 


TGAGTGAGCT 


AACTCACATT 


AATTGCGTTG 


CGCTCACTGC 


CCGCTTTCCA 


1951 


GTCGGGAAAC 


CTGTCGTGCC 


AGAGATCTCT 


GCATTAATGA 


ATCGGCCAAC 


2001 


GCGCGGGGAG 


AGGCGGTTTG 


CGTATTGGGC 


GCTCTTCCGC 


TTCCTCGCTC 


2051 


ACTGACTCGC 


TGCGCTCGGT 

\ 


V. w X XWwwV.>X.w 


CGGCGAGGGG 


TATCAGATCG 


2101 


ATCTCACTCA 


AAGGCGGTAA 


TACGGTTATC 


CACAGAATCA 


GGGGATAACG 


2151 


CAGGAAAGAA 


CATGTGAGCA 


AAAGGCCAGC 


AAAAGGCCAG 


GAACCGTAAA 


2201 


AAGGCCGCGT 


TGCTGGCGTT 


TTTCCATAGG 


CTCCGCCCCC 


CTGACGAGCA 


2251 


TCACAAAAAT 


CGACGCTCAA 


GTCAGAGGTG 


GCGAAACCCG 


ACAGGACTAT 


2301 


AAAGATACCA 


GGCGTTTCCC 


CCTGGAAGCT 


CCCTCGTGCG 


CTCTCCTGTT 


2351 


CCGACCCTGC 


CGCTTACCGG 


ATACCTGTCC 


GCCTTTCTCC 


CTTCGGGAAG 


2401 


CGTGGCGCTT 


TCTCATAGCT 


CACGCTGTAG 


GTATCTCAGT 


TCGGTGTAGG 


2451 


TCGTTCGCTC 


CAAGCTGGGC 


TGTGTGCACG 


AACCCCCCGT 


TCAGCCCGAC 


2501 


CGCTGCGCCT 


TATCCGGTAA 


CTATCGTCTT 


GAGTCCAACC 


CGGTAAGACA 


2551 


CGACTTATCG 


CCACTGGCAG 


CAGCCACTGG 


TAACAGGATT 


AGCAGAGCGA 


2601 


GGTATGTAGG 


CGGTGCTACA 


GAGTTCTTGA 


AGTGGTGGCC 


TAACTACGGC 


2651 


TACACTAGAA 


GGACAGTATT 


TGGTATCTGC 


GCTCTGCTGA 


AGCCAGTTAC 
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2701 


CTTCGGAAAA 


AGAGTTGGTA 


GCTCTTGATC 


CGGCAAACAA 


ACCACCGCTG 


2751 


GTAGCGGTGG 


TTTTTTTGTT 


TGCAAGCAGC 


AGATTACGCG 


CAGAAAAAAA 


2801 


GGATCTCAAG 


AAGATCCTTT 


GATCTTTTCT 


ACGGGGTCTG 


ACGCTCAGTG 


2851 


GAACGAAAAC 


TCACGTTAAG 


GGATTTTGGT 


CATGAGATTA 


TCAAAAAGGA 


2901 


TCTTCACCTA 


GATCCTTTTA 


AATTAAAAAT 


GAAGTTTTAA 


ATCAATCTAA 


2951 


AGTATATATG 


AGTAAACTTG 


GTCTGACAGT 


TACCAATGCT 


TAATCAGTGA 


3001 


GGCACCTATC 


TCAGCGATCT 


GTCTATTTCG 


TTCATCCATA 


GTTGCCTGAC 


3051 


TCCCCGTCGT 


GTAGATAACT 


ACGATACGGG 


AGGGCTTACC 


ATCTGGCCCC 


3101 


AGTGCTGCAA 


TGATACCGCG 


AGACCCACGC 


TCAC GGCTC 


CAGATTTATC 


3151 


AGCAATAAAC 


CAGCCAGCCG 


GAAGGGCCGA 


GCGCAGAAGT 


GGTCCTGCAA 


3201 


CTTTATCCGC 


CTCCATCCAG 


TCTATTAATT 


GTTGCCGGGA 


AGCTAGAGTA 


3251 


AGTAGTTCGC 


CAGTTAATAG 


TTTGCGCAAC 


GTTGTTGCCA 


TTGCTACAGG 


3301 


CATCGTGGTG 


TCACGCTCGT 


CGTTTGGTAT 


GGCTTCATTC 


AGCTCCGGTT 


3351 


CCCAACGATC 


AAGGCGAGTT 


ACATGATCCC 


CCATGTTGTG 


CAAAAAAGCG 


3401 


GTTAGCTCCT 


tCGGTCCTCC 


GATCGTTGTC 


AGAAGTAAGT 


TGGCCGCAGT 


3451 


GTTATCACTC 


ATGGTTATGG 


CAGCACTGCA 


TAATTCTCTT 


ACTGTCATGC 


3501 


CATCCGTAAG 


ATGCTTTTCT 


GTGACTGGTG 


AGTACTCAAC 


CAAGTCATTC 


3551 


TGAGAATAGT 


GTATGCGGCG 


ACCGAGTTGC 


TCTTGCCCGG 


CGTCAATACG 


3601 


GGATAATACC 


GCGCCACATA 


GCAGAACTTT 


AAAAGTGCTC 


ATCATTGGAA 


3651 


AACGTTCTTC 


GGGGCGAAAA 


CTCTCAAGGA 


TCTTACCGCT 


GTTGAGATCC 


3701 


AGTTCGATGT 


AACCCACTCG 


TGCACCCAAC 


TGATCTTCAG 


CATCTTTTAC 


3751 


TTTCACCAGC 


GTTTCTGGGT 


GAGCAAAAAC 


AGGAAGGCAA 


AATGCCGCAA 


3801 


AAAAGGGAAT 


AAGGGCGACA 


CGGAAATGTT 


GAATACTCAT 


ACTCTTCCTT 


3851 


TTTCAATATT 


ATTGAAGCAT 


TTATCAGGGT 


TATTGTCTCA 


TGAGCGGATA 


3901 


CATATTTGAA 


TGTATTTAGA 


AAAATAAACA 


AATAGGGGTT 


CCGCGCACAT 


3951 


TTCCCCGAAA 


AGTGCCACCT 


GACGTCTAAG 


AAACCATTAT 


TATCATGACA 


4001 


TTAACCTATA 


AAAATAGGCG 


TATCACGAGG 


CCCTTTCGTC 


TCGCGCGTTT 
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4051 CGGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCG GAGACGGTCA 

4101 CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA GACAAGCCCG TCAGGGCGCG 

4151 TCAGCGGGTG TTGGCGGGTG TCGGGGCTGG CTTAACTATG CGGCATCAGA 

42 01 GCAGATTGTA CTGAGAGTGC ACCATATCGA CGCTCTCCC7 TATGCGACTC 

4251 CTGCATTAGG AAGCAGCCCA GTAGTAGGTT GAGGCCGTTG AGCACCGCCG 

4 301 CCGCAAGGAA TGGTGCATGC AAGGAGATGG CGCCCAACAG TCCCCCGGCC 

4351 ACGGGGCCTG CCAGCATAGG GACGCCGAAA CAAGCACTAA TAGGAATTGA 

44 01 TTTGGATGGT ATAAACGGAA ACAAAAAAAA GAGCTGGTAC TACTTTCTTT 

4451 AAAATTATTT TATTATTTGA TTTTATTTAA TAGTATATAT TATATTTTGA 

4501 ACGTAGATTA TTTTGTTGAA AGTTGCTGTA GTGCCATTGA TTCGTAACAC 

4551 TAATTCTGTA TTAGTCATTC CTCTTGTTTG ATAGTATCCA AAAAAACGGC 

4 601 TATTTTTTTG CAATCTTATT TCCTGCATAT TATACAGATA ACATAATGAA 

4 651 AGAAAAAATC TTTTTTTTTG TTCTTCAATG ATGATTTCAA CCATTCTTTT 

4701 AAACATTGAT CAATTCCTGA GCAACAACCC CATACACACT GGTTTATATA 

4751 CCGCCCCTTT TACAGTTGAA GAAAGAAATA GAAATAGAAA TAGCAAACAA 

4 801 AAGATATGAC'AGTCAACACT AAGACCTATA GTGAGAGAGC AGAAACTCAT 

4 851 GCCTCACCAG TAGCACAGCG ATTATTTCGA TTAATGGAAC TGAAGAAAAC 

4901 CAATTTATGT GCATCAATTG ACGTTGATAC CACTAAGGAG TTCCTCGAGT 

4951 TAATTGATAA ATTAGGTCCT TATGTATGCT TAATCAAGAC TCATATTGAT 

5001 ATAATCAATG ATTTTTCCTA TGAATCCACT ATTGAACCAT TATTAGAACT 

5051 TTCACGTAAA CATCAATTTA TGATTTTTGA AGATAGAAAA TTTGCTGATA 

5101 TTGGTAATAC CGTAAAGAAA CAATATATTG GTGGAGTTTA TAAAATTAGT 

5151 AGTTGGGCAG ATATTACCAA TGCTCATGGT GTCACTGGGA ATGGAGTGGT 

52 OX TGAAGGATTA AAACAGGGAG CTAAAGAAAC CACCACCAAC CAAGAGCCAA 

5251 GAGGGTTATT GATGTTAGCT GAATTATCAT CAGTGGGATC ATTAGCATAT 

5301 GGAGAATATT CTCAAAAAAC TGTTGAAATT GCTAAATCCG ATAAGGAATT 

5351 TGTTATTGGA TTTATTGCCC AACGTGATAT GGGTGGCCAA GAAGAAGGAT 
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54 01 TTGATTGGCT TATTATGACA CCTGGAGTTG GATTAGATGA TAAAGGTGAT 

54 51 GGATTAGGAC AACAATATAG AACTGTTGAT GAAGTTGTTA GCACTGGAAC 

55 01 TGATATTATC ATTGTTGGTA GAGGATTGTT TGGTAAAGGA AGAGATCCAG 
5551 ATATTGAAGG TAAAAGGTAT AGAAATGCTG GTTGGAATGC TTATTTGAAA 
5601 AAGACTGGCC AATTATAAAT GTGAAGGGGG AGATTTTCAC TTTATTAGAT 
5651 TTGTATATAT GTAGAATAAA TAAATAAATA AGTTAAATAA ATAATTAAAT 
5701 AAGGGTGGTA ATTATTACTA TTTACAATCA AAGGTGGTCC TTCTAGCTGT 
5751 AATCCGGGCA GCGCAACGGA ATTCATCA GTGTAAAAAT GGAATCAATA 
5801 AAGCCCTGCG CTCATGAGCC CGAAGTGGCG AGCCCGATCT TCCCCATCGG 
5851 TGATGTCGGC GATATAGGCG CCAGCAACCG CACCTGTGGC GCCGCAGCGC 
5901 GCAGGGTCAG CCTGAATACG CGTTTAATGA CCAGCACAGT CGTGATGGCA 
5951 AGGTCAGAAT AGCCCAAGTC GGCCGAGGGG CCTGTACAGT GAGGGAAGAT 
6001 CTGATATTGA CGAAGAGGAA CCAATGTAAC GTTACACTGA AGAAAACACA 
6051 CAATAAACGG-GAAGAAACGG TGTAAAAGTG TGAAAATAAT TTTTGAATAT 
6101 CATTTCCCTT-GGTTTAATTC CAAACGAAAC GTGTTTTTTT TAGAGAATGG 
6151 GAA7TCTTAT TGGATGTCTA GATTGTTTGT TTACTCCAGA CTGTGCACAA 
6201 AAACGTTTGG ATGGATGATC AGAAGATATT TTTAGGCTTA GCTCTAAATA 
6251 TAAGAAATGA TGCTTGAAAA ACCAGACAGA AATTGAGTTT CAAAAATTGG 

63 01 TAATGTGAGG TATTAGTCAA CTAACCAAAT AACAATGCAA ACCGGTTGAT 
6351 ACATTTCATT TTGAAAATAA TGAAACTGGA ATTGGATGAC CAGCACACAA 

64 01 ACACATAAAG TAATTATGGG AATTAGAAGC GAACATAGAG GAGTACTTGG 
6451 CCACGAACAG AATACAAGTG GGAACACTAT TTTCTCCATT GTTTTAGTTC 
6501 TGTTTTTTTG TCAGCCTAGT TTTGTGCTAT GTGTAAAAAA TATTGCCAAG 
6551 AAAAAAAGCT TGTTTTGTGG CCAGTGTCCG AAAAAAATTT TGGGGAATCT 
6601 TCGGATTAAT TTATGTTTTC A 
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MITDEQLNTI ALTFGFASII LIIIYHAIST NVHKLEDETP SSSFTRTNTT 
1 50 

ETTVASKKKK 
51 60 



MLKTLTQTLR LTGKAFPKVR PALIRTYAAF DRSKPHVNIG TIGHVDHGKT 
1 50 

TLTAAITKVL AEQGGANFLD YGSIDRAPEE RARGITISTA HVEYETKNRH 
51 100 

YAHVDCPGHA DYIKNMITGA AQMDGAIIW AATDGQMPQT REHLLLARQV 
101 150 

GVQDLWFVN KVDTIDt)PEM LELVEMEMRE LLSTYGFDGD NTPVIMGSAL 
151 200 

MALEDKKPEI GKEAILKLLD AVDEHIPTPS RDLEQPFLLP VEDVFSISGR 
201 250 

GTWTGRVER GVLKKGEEIE IVGGFDKPYK TTVTGIEMFK KELDSAMAGD 
251 300 

NCGVLLRGVK RDEIKRGMVL AKPGTATSHK KFLASLYILT SEEGGRSTPF 
301 350 

GEGYKPQCFF RTNDVTTTFS FPEGEGVDHS QMIMPGDNIE MVGELIKSCP 
351 400 

LEVNQRFNLR EGGKTVGTGL ITRIIE 
401 426 
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MEVTQRTQSQ TQPTQQSPTT QTQTQSKEDQ NRICQLICST GQFGNYDLNI 
1 50 

NDKTIVQGKM TWYFGRDPNS DLQVASSSRI SNKHFQIWLN FNDKSLWIKD 
51 100 

TSTNGTHLNN SRLVKGSNYL LNQGDEIAVG VGRDEDWRF WVFGDKYNP 
101 150 

AKLPDSTNTI KDEGIYKDFI VKNETIGQGA FATVKKAIER STGESYAVKI 
151 200 

INRRKALNTG GGSAMkGVDR ELSILERLNH PNIVALKAFY EDMDNYYIVM 
201 250 

ELVPGGDLMD FVAANGAIGE DATQVITKQI LEGIAYVHNL GISHRDLKPD 
251 300 

NILIMQDDPI LVKITDFGLA KFSDNSTFMK TFCGTLAYVA PEVITGKYGS 
301 350 

SQMESQQKDN YSSLVDIWSL GCLVYVLLTS HLPFNGKNQQ QMFAKIKRGE 
351 ^ 400 

FHEAPLNSYD ISEDGRDFLQ CCLQVNPKLR MTAAEALKHK WLQDLYEEDS 
401 450 

VKSLSLSQSQ SQQSRKIDNG IHIESLSKID EDVMLRPLDS ERNRKSSKQQ 
451 500 

DFKVPKRVIP LSQHPATPLP MSQPKKRPYQ IDPRTNKKVD LEEPSTSKKV 
501 550 

KLSDSWAED YLKLEPLANS LFQETINISK SPFSFGRNDT CDCEIDDDRL 
551 600 

SKLHCVITKE NDSIWLLDKS TNSCLVNNTS VGKGNKVLLR GGEILHLFFD 
601 650 

PLSSQHIGFK WLVDQSSGE HKSQVEVLKQ TSEEMNIIPL ISGLSSISS 
651 699 
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MGTSTSEALK NIKNKQRRQK VFAEIKHEKN KQRHKQRAER AKEERENPEL 

1 50 

REERIAANIP DTIDSKRIYD ETIAAEVEGD DEFQSYFTNL LEEPKILLTT 

51 100 

SANAKKPAYE FADMIMDFLP NVTFIKRKKE YTMQDMAKYC SNRDFTALLV 

101 150 

IMEDKKIO.^IG ITLIMLPEGP TFYFSITSIV DGKRIKGHGK A-GDYLPEIVL 

151 200 

NNFNSRLGKT VGRLFQSIFP HKPELQGRQV ITLHNQRDYI FFRRHRYTFR 

201 250 

NEEKVGLQE GPQFTLKLRRM QKGVRGDWW EHRPDMERDK KKFYL 

251 295 



MNSEKIIEVI lAIFLPPVAV FMKCGATTPL WINLVLCIFI WFPAILHALY 
1 50 

WLKD 
51 
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MTLGFDKFIS KVSTHRRQSE PSILEIAATN SQNKSRRLSM DNGHCYVRES 
1 

50 

TNNHHHLNTV VDNLRQRAGS rSFISHHHNH HQNSHDNYTV DPLTSNGARI 
51 

100 

SRSRSRSKSV GHGEAISPAY FSKNKTKDLV KQETAHIISK KLLNMLQDLD 
101 

150 

LONPIALKTI SOGSESKFCK lyvSNTNNCI YLPAASSTSF TYEDDENGGV 
151 

200 

IIAEDRNDEM PTAVNNNTLS MDSINHSETD FSDSPPPPDL FSKMKSFHSP 
201 

2 .^ ) 

NYLTSKIDSE CPIPHTFAVI VELTKDSLII KDLHFQFQSL TTILWPTGDA 
251 

300 

YNRTHAKEKF TIGNMEWRTS LSDADYVINS SNSNDVKSKN LGPEDLINRT 
301 

350 

REYKLIDIEE PNNSSNSLSD DDMDINNITS PLSTSPTSSS TSTNSTSNSL 
351 

400 

GSDSYKAGLY VFLLPILLPE HIPASIVSXN GSLAHTLSVE CNKYTDKLNR 
401 

450 

KSKVSASYNL PMVRTPPNIG NSIADKPIYV NRIWNDAVHY IITFPRKYVT 
451 

500 

LGCEHMINVK LSPMVKDWI KRIKFNVLER ITYVSKNLSR EYDYDSEDPY 
501 

550 

CIHPVSKENK VRERWSLYE LKTKAKQSSG GHLEAYKQEV MKCPENNLLF 
551 

600 

SCYEVENDNN NGNGNGNGNG MKKVKQKNKD QPMIATPLDI NVSLPFLTTM 
601 

650 

SDSLIMTSAI EEEGSDSPHT SRRGSAVSMT DNNTTPSNNN PLSPFLGAVE 
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651 
700 

TnGASINEIG DHTLFPDSNF RHIEIKHRLQ VTFRISKPDS DNKMHHyEW 
701 

750 



IDTPIVLLSS KCQEDSPPPY SSV 

751 773 



MGEGTPSLGKRHNKSHTLCNRCGRRSFHVQKKTCSSCGYPAAKMRSHNWALKAKRRRTTGTGRMAYLK 
HV 

TRRFKNGFQTGVAKAQTPSA 
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Project : Identification oF novel essentiat genes in C albicans 
Strain no. : 26g 

Freezer location : glycerol stocks box XXIII; C8 
Growth curve(s) (Bioscreen) : 

Date : 05/11/1997 



2.0 Candida strains Jasmine Uiaene (85/11/^7) 
CD [glucose (uninduced) us galactose (induced) 



Lin 



^tan 26 



Sel 
A 

Fig 
1 



B.B 




e:ee 

pArms F irst 
CD : B.39e 
tine: 8:68 



riax 
1.285 
85:45 



Start 
e.398 
8:88 



48:68 
End Lou 
0.584 0.888 
24:88 iliiili 



High 
1.888 



TiPie: 36:08 houjrs 
Slope II Area 11 Det.T 

8 781 8.591 
Sill B 781 11 24:45 

Ctrl-Enter = Beadm 



Plasmid/clone name* : 26g3 
Freezer location : original stocks box 
Identifier (gene name) : 



HTS screen : 



Form generated by : Inge Loonen 

(^)as it can be found in the Candida albicans Access dbase 
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Project : [dcntification of novel essential genes in C albicans 
Strain no. : 322c_cp 

Freezer location : glycerol stocks box XIV; D6 
Growth curve (s) (Bioscreen) : 

Date : 26/05/1998 



C. albicans iiorary screening 2d5cp, 310cp-331cp, 32baf-349af 
(batchS) 26/05/98 
glucose/maltose vs gaiactose/maltose 
Sample 322cp 




Time (h) 



Induced | 
Induced I 
Uninduced 1 
Uninduced I 



Plasmid/cione name^ : 322c_cp 
Freezer location : original stocks box 
Identifier (gene name) : 



HTS screen : 



IFooB geQeratedl by : Inge Lmmm 

(*)as it can be found in the Candida albicans Access dbase 
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Project : Identification of novel essential genes in C albicans 
Strain no. : 325c_af 

Freezer location : glycerol stocks box XIII; G4 

Growth curve(s) (Bioscreen) : Date : 26/05/1998 



C. albicans library screening 285cp, 310cp-331cp, 325af-349af t 
(batchS) 26/05/98 
glucose/maltose vs ga lactose/maltose 

Sample 325cp 1 




Plasmid/cione name* : 325c_af 
Freezer location : original stocks box 
Identifier (gene name) : 

HTS screen : 



Form generated by : inge Loonen 

(^)as it can be found in the Candida albicans Access dbase 
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Project : Identification of novel essential genes in C albicans 

Strain no. : 382cp (FACS. batch 7, Gl) 
Freezer location : glycerol stocks box XVI; A2 

Growth curve(s) (Bioscreen) : Date : 09/06/98 



grucoseymaltose vs galactose/mattose 
382 




■nme (h) j induced 



Plasmid/clone name* : 382cp (purified PGR product) 
Freezer location : original stocks box VIII; AAH8 
Identifier (gene name) : OST4 

HTS screen : 



Form generated by : Inge Loonen 

( *) as ii can be found in the Candida albicans Access dbase 
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Proiect : Identification of novel essential genes in C albicans 



Strain no. : 392cp (FACS, batch 7, Gl) 
Freezer location : glycerol stocks box XVI; B3 
Growth curve(s) (Bioscreen) : 



Date : 09/06/98 



I 



090698RH, Batch?, gr.1 No. 355-401 
glucoseymattose vs galactose/maftose 
392 




- Induced! 

- Induced 

- Induced 

- Induced 



Plasmid/done name'^ : 392cp (purified PGR product) 
Freezer location : original stocks box VIII; AAH2 
Identifier (gene name) : TUFl 

HTS screen : 



Fo&rm generated hy : Inge Loonen 

f as it can be found in the Candida albicans Access dbase 
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Project : Identification of novel essential genes in C. albicans 
Strain no. : 409c_cp 

Freezer location : glycerol stocks box XVI; C9 
Growth curve(s) (Bioscreen) : 

Date : 19/06/1998 



C. albicans library screening experiment 19/06/98 (FACS screening, 

batch 7) 

glucose/maltose vs galactose/maltose 
sample 409cp G2 (large colony, non-transparent) 




^ <o <V T,^ 4* t>> # # A*** «P 



Tima (h) | kiduced j ! 



Plasmid/clone name*^ : 409c_cp 
Freezer location : original stocks box 
Identifier (gene name) : 

HTS screen : 



IFonra generated by : lege Loooen 

(*)as it can be found in the Candida albicans Access dbase 
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Project : Identification of novel essential genes in C. albicans 

Strain no. : 417c_cpG2 

Freezer location : glycerol stocks box XVI; D8 

Growth curve(s) (Bioscreen) : 



Date : 19/06/1998 



C. albicans library screening experiment 19/06/98 (FACS screening, 

batch 7) 

glucose/maltose vsga lactose/maltose 
sample 417cp G2 (small colony, non-transparent) 



I ^ 




Time (h) 



- Uninduced 
-Uninduced 

- Induced 

- hduced 



Plasmid/clone name^ : 417c_cpG2L 
Freezer location : original stocks box 
Identifier (gene name) : 



HTS screen : 



Form generated by : Inge Looneini 

(*)asit can be found in the Candida albicans Access dbase 
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Project I Identification of novel essential genes in C albicans 
Identifier (gene name) : 325caf 

Disruptant strain : Host strain : 

Freezer location : 

Disruption piasmid name* : 
Freezer location : 

Knock*out (single/double): 
Lab book ref. : 

Southern results : 



PCR results : 
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Growth curve(s) : 
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HTS screen : 

Bioscreen test of 325caf knock-out and WT growth in presence of hygromydn B 

diliBtioEis prepared 

Stork solution of 53 mg/ml was prepared for hygromycin B. 
From this solution dilutions were prepared of: 
4000Mg/inl. SOOOpg/ml, 2000Mg/ml and lOOOMg/ml 



Growth curves for 325cafgK knock-out and WT in the presence of hygromycin B 
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He|p=Fl,P2 ■ 51X51.1) Sariple 51 

2iff ^Hygronycin B test 83/11/1999 
OD double knock-out 325caf UH cl.l 

Lin 



=B31199il 




■Color"-.' 
Js&r^tB 

^rfext ' 



|:^S:i"bp« ftipeiL^: | •^l6tV:T4■rf 
i t3?: I 16:35 " 



H&lp=Pl;F2 ■ '56 056.1-)" Samp le -5fr - - 
2.Bi Hysrrowycin B test B3/11/1999 
OS .double knock-out 325caf UH cl .6 

iin : 



e3119911> 




ia;/iGS;-.S:<?ti;^^^?li:;^^^ 

';{^12:i8 ;'r" '^t — ^ 



ilP^pess^EntOT.^to edit "teytf^ 




y;;^35; 



SUBSTITUTE SHEET (RULE 26) 



wo 00/34481 



PCT/EP99/09833 



2.B Hygronycin B test 63/11/1999 
OB .CAI4 
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Legend: 

1: S/2% gal/2% mal medium containing 0 ^ig/ml Hygromycin B 
2: S/2% gal/2% mal medium containing 1000 fig/ml Hygromycin B 
3: S/2% gal/2% mal medium containing 2000 jig/ml Hygromycin B 
4: S/2% gal/2% mal medium cooiaining 3000 ng/ml Hygromycin B 
5: S/2% gal/2% mal medium containing 4000 fig/ml Hygromycin B 
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Protect : Identification of novel essential genes in C albicans 



Identifier (gene name) : 
Disruptant strain : 
Host strain : 
Freezer location : 

Disruption piasmid name* ; 
Freezer location : 



322c_cp 

322c_cp (in progress) 
CAI4NG 

Knockout strain, box. 



pos. 



322c_cpURAcass.(inv)/pCR2.1(inv) 



Knock-out (single/double): 
Lab book ref. : 



single (in progress) 

Labbook 104 of Ronald de Hoogt 



Southern results 



PCR results : 
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Project : Identification of novet essential genes in C. albicans 

Identifier (gene name) : 417c_cpG2 

Disrupunt strain : Host strain : 

Freezer location : 

Disruption plasmid name* : 
Freezer location : 

Knock-out (single/double): 
Lab book ref. : 

Southern results : 



PCR results : 
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Growth curve(s) : 
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HTS screen : 

Bioscreen test of 417c_cp knock-out and WT growth in presence of hygromycin B 

Stock solution of 53 mg/ml was prepared for hygromycm B. 
From this solution dilutions were prepared of: 
4000^ig/ml, 3000Hg/ml. 2000^g/ml and 1000^g/^ll 

Growth curves for 417c_cp knock-out and WT in the presence of hygromycin B 
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2.B, Hygronycin B test 83/11/1999 
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Heli>^l^I!2. . 51(81. mSanpU B^ ... B31199il 

2^8 .Hygronycin B test 63/11/1999 djix:*!^iz 
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Legend: 

1 : S/2% gal/2% mal medium containing 0 ^g/ml Hygromycin B 
2; S/2% gal/2% mal medium containing 1000 jig/ml Hygromycin B 
3: S/2% gal/2% mal medium containing 2000 fig/ml Hygromycin B 
4: S/2% gal/2% mal medium containing 3000 jig/ml Hygromycin B 
5: S/2% gal/2% mal medium containing 4000 jig/ml Hygromycin B 
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Project : Identification of novel essential genes in C albicans 

Identifier (gene name) : TUFl 
Disruptant strain : TUFISAKO 7 
Freezer location : Strain collection Roland Conireras. YA132 



Host strain : CAI4/NG 



Disruption plasmid name* : Not applicable: short terminal homology (STH) PCR with 
overlapping fragments (split inarker). 



MLG80 


acacaaccaagEgaacLcacccacaccacccactgcggaaacicccgaacGTGCTGGAATrCGCCui.TTATG 


ML081 


tcacctacacacacccccttcctctctatcactcacagcgcacactctgtCCGGCTCGTATGTTGTG 
TGG 


ML094 


CCAGTGCTAACAACTTCATCAACAGTT 


ML092 


GCCTCACCAGTAGCACAACG 



sequences are 50 nl upisiream (MIjOSO), resp. downstream (ML081) of the target ORF. 

MU081 (77.t%) 



MLj080{7S^) 



\ 



TUFl 



Amplification of STH fragments for TUFl 
(ML080/ML094 => TUFl jSTH-STJRAS and 
ML081/ML092 => TUF1_STH-3*URA3). 
TUFl.STH-STJRAB has a 50 bp terminal homology 
region upstream of the TUFl ORF and a 3' 
incomplete URA3 marker, while TUFl.STH- 
3TJRA3 has a 50 bp terminal homology region 
downstream of the TUFl ORF and a 5' incomplete 
URA3 marker. In vivo, only an intact URAblasi 
cassette can be formed when recombination occurs 
between the overlapping truncated URA3 sequences 
of the respective STH fragments. 




M10>J 111 

HISG tptrtitl) 
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Southern results: s^/^^^A^/Ttvc^^z? /J 

Presentation of disrupted allele 



lid 111(13914) HlSC(parUal) 
W.>l I!in34Jfi) \ SENSE PBM\ 

\ M 



(100.0%) 
URA3 

(100.0%) 

HtSG (partial) 

Hift 111(17697) 




FragmentofTUFI SAKO 





Hmm digest 
URA3 probe 
Expected band: 3783 bp 



PCR results: 

> PCR analysis was performed using the primer combination ML091/ML093 (see figure), 
amplifying a URA3 fragment.. Band of 755 bp points to correct homolc^ous 
recombination of URA3 overlapping fragments. TUFl SAKO 7 is clearly positive 
(SAKO stands for single allele knock out). 




lane 1: \Pst\ 

lane 2 to 9: potential TUFl S AKOs 1 to 8 
lane 10: Uracass shoit/pUCl9 (positive 
control) 
lane 1 1 : water 
lane 12: CAM 
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> To check correct integraUon into the genome, PCR was performed with primer sets 
ML090/ML097. For TUFl SAKO 7 a dear signal was obtained of the correct length of 
1825 bp. ^ 



X 12345678 



a: X Psti 

1 to 8: potential TUFl SAKOs i to 8 



Northern analysis: 

88% inhibition 



TUF1 SAKO 7 




TUF1 



ACT1 



Growth curve(s) 
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HTS screen : 
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Bioscreen test of TUFl knock-out clone 7 and WT growth in presence of hygromycin B 
dilutions prepared 

Stock solution of 53 mg/ml was prepared for hygromycin B. 
From this solution dilutions were prepared of: 
4000^g/ml. 3000^g/ml. 2000^g/ml and lOOO^g/ml 



Growth curves for TUFl knock-out and WT in the presence of hygromycin B 
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Legend: 
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I: S/2% gai/2^ mal medium containing 0 (ig/nil Hygromycin B 
2: S/2% gal/2% mal medium containing 1000 ^g/ml Hygromycin B 
3: S/2% gal/2% mal medium containing 2000 ^g/ml Hygromycin B 
4: S/2% gal/2% mal medium containing 3000 jig/ml Hygromycin B 
5: S/2% gal/2% mai medium containing 4000 ng/ml Hygromycin B 
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SEQUENCE LISTING 

<110> Janssen Pharmaceutica N. V. 

<120> Drug Targets in Candida Albicans 

<130> 53731/000 

<140> 
<141> 

<150> 982204122.0 
<151> 1998-12-04 

<160> 16 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 438 
<212> DNA 

<213> Candida al*jicans 
<400> 1 

aacagctggt cttctgctaa tacattcaac cctttccata tctatactcc aacaatatga 60 
taactgatga acaattgaat accattgcat tgacatttgg ttttgcttca ataatattaa 120 
tcataatata tcatgccata tctactaatg tacataaatt agaagatgaa accccatcat 180 
cttcatttac cagaacaaat actactgaaa ctactgttgc aagtaagaaa aagaagtaat 240 
aactgatgga tttttcttcc taccaccaat tgaataatgc tagacttgtt ggtgtgctac 300 
aaatatttca aaagaaaata cgaat^acttt ataaaatggt aagaacggaa gatggtttct 360 
catttataca ctaaatacaa atcaqatdca catacacaaa cacaaataca tacatacacc 420 
tatatccctt tatttgat 438 

<210> 2 
<211> 1380 
<212> DNA 

<213> Candida albicans 
<400> 2 

atgttaaaaa cactaactca aactttacgc ttaactggga aagctttccc aaaggtccgt 60 
ccggccttga tcagaaccta cgctgccttc gaccgttcta aacctcatct caacattggt 120 
actattggtc atgttgatca tggtaaaact acattgactg ctgctatcac caaagtttta 180 
gccgaacaag gtggtgccaa crtcttggat tatggttcta ttgatagagc tccagaagaa 240 
agagctagag gtatcactat ttccactgcc cacgttgaat acgaaaccaa gaacagacac 300 
tatgcccacg ttgattgtcc aggacacgct gattatatca aaaatatgat tactggtgcc 360 
gctcaaatgg atggtgcrat cartgttgtt gctgccactg acggtcaaat gcctcaaacc 420 
agagaacatt tgttattggc cagacaagtt ggtgttcaag acctggttgt gtttgtcaac 480 
aaagtcgata ctattgatga ccctgaaatg ttggaattag tcgaaatgga aatgagagaa 540 



1 
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ttgttatcca cctacggtt:: tgatggtgac aacactccag ttattatggg atctgcttta 600 

atggctttgg aagacaagaa accagaaatt ggtaaggaag ctatcttgaa attgttagat 660 

gctgtcgatg aacacattcc aactccatca agagacttgg aacaaccatt tttgttacca 720 

gttgaagacg tgttctccat ctccggtaga ggaactgttg tcactggtag agttgaaaga 780 

ggtgttttga agaagggtca agaaatcgaa attgttggtg gttttgacaa accttacaag 840 

actactgtta ccggtattca aatgttcaaa aaagaattag accctgctat ggctggtgac 900 

aactgtggtg ttttgttaag aggtgttaaa agagatgaaa tcaagagagg tatgg^tttg 960 

gccaaaccag gtactgctac ttctcacaag aagttcttgg cttccttgta tattttgact 1020 

tccgaagaag gtggtcgttc cactccattt ggtgaaggtt acaagcctca atgcttcttc 1080 

agaactaacg atgtcactac cacattttca ttcccagaag gagaaggtgt tgatcattct 1140 

caaatgatca tgccaggrga caacattgaa atggttggtg aattgatcaa atcttgtcca 1200 

ttagaagtca accaacgttt caacttgaga gaaggtggta aaactgttgg tactggtttg 1260 

attaccagaa tcatcgaata aacagaatgt gcactgtgaa taataaaaag aaaagaggta 1320 

tatataggtg actttgtatt ttgtattgaa caataaaatt ctgtaaatag taagggcctc 1380 

<210> 3 
<211> 2283 

<212> DNA " 
<213> Candida albicans 



<400> 3 

gaattcgccc ttaagcactc gtttcaacta 
aaactacatt aatggaagta acacaacgga 
caccgacaac tcagacgcaa acccaaagca 
tttgctccac gggtcagttt ggcaattatg 
aaggtaaaat gacgtggtaz tttggaagag 
cgtcgagaat ttcaaacaag cattttcaaa 
ggataaagga cacttcaact aacgggacac 
caaactacct tcttaatcag ggtgatgaaa 
ttgtgaggtt tgtcgttg-c cttg^tgaca 
ccaacacaat taaagatgaa ggaatataca 
gccaaggagc atttgccact gtgaaaaagg 
cggtgaagat tataaatcga agaaaagcat 
gagtggaccg tgaattgtcc atattagagc 
aagcttttta tgaagatatg gacaattact 
atttgatgga ctttgtggct gcaaacggtg 
cgaaacagat tctagaagga attgcctatg 
tgaagccaga taatattitg attatgcaag 
ttggattggc aaaattcacr gacaatctga 
cgtatgttgc tcccgaactt atcaccggta 
aaaaggacaa ctactcttcc ttggttgaca 
ttttaacttc tcatttacca ttcaacggga 
aaaggggcga atttcatgag gctccattaa 
atttcttgca gtgctgccta caggttaatc 
tgaaacataa atggttgcaa gacttgtatg 
cgcaatcaca gtcgcaacaa tctcgaaaga 
gcaaaattga tgaagacgt- acgcttcgtc 
gtaaacagca agatttcaag gcacccaagc 



tacattcagt aacaacaccc ttaatttar - 60 
cgcagagtca gacacaacca acacaacagt 120' 
aagaggacca gaataggatt tgtcaattga 180 
atttgaatat caacgataaa actatcgtac 240 
accccaactc agatttgcaa gtggcgtcgt 300 
tctggctcaa cttcaatgat aaatcactat 360 
accttaacaa cagtcgattg gtgaaaggat 420 
tagcagtagg ggttggtaga gacgaggacg 480 
aatacaaccc ggcaaagcta cctgattcga 540 
aagactttat tgtgaaaaat gaaacgacag 600 
cgattgaacg atctacgggc gagccgtacg 660 
taaataccgg tggtggaagt gccatggcag 720 
ggctcaacca cccaaatata gttgctctaa 7 80 
atattgtgat ggaattggtg ccgggcggtg 840 
caataggaga agacgcaaca caagtgatca 900 
ttcataattt aggaatctcc catcgtgatt 960 
atgacccaat acttgttaaa atcaccgact 1020 
cgtttatgaa aactttttgt ggtacattgg 1080 
agtatggatc atcgcagatg gaacrgcaac 1140 
tttggtcttt gggatgtttg gtttatgtac 1200 
aaaaccagca acaaatgttt gccaagatca 1260 
attcatacga catttctgaa gacggaagag 1320 
ctaaactaag gatgacggct gctgaagctt 1380 
aagaggattc tgtcaaatca ttgagtttat 1440 
tagataatgg tatccatatc gaatcattga 1500 
cattggatag cgaaagaaat aggaaatcaa 1560 
gtgtgattcc gttatctcaa catcctgcaa 1620 
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caccgttacc aatgtcacaa ccgaaaaaga ggccgtatca aatagaccct agaacaaaca 1680 

aaaaagtcga tttggaagaa cctctgacaa gcaagaaagt caagctaagt gattccgttg 1740 

ttgcggaaga ctacttgaag ttggggccac ttgcaaattc gttattccaa gaaacaataa 1800 

atatttcaaa gtccccgttt tctttcggaa gaaatgacac ttgtgattgc gagatagacg 1860 

acgacagact atccaaactt cattgtgtca ttaccaaaga aaacgactct atatggttat 1920 

tggataagag tactaactcg tgcttggtca acaatactag tgttggaaaa ggcaacaaag 1980 

ttttgcttag aggaggggag atattacatc tcttctttga cccattgtca ctgcaacata 2040 

taggtttcaa agtagtcctt gttgatcaac tgtctggtga acataagagt caagtggagg 2100 

ttttgaaaca aacctcagaa gaaatgaata ttattccact tatttctggt ttaagtagta 2160 

taagttcata gatttagcat atatacaagc atttcctata gaaacaaagg ttcattaatt 2220 

tagttattta cctccatgca attacattta cttcttcttc caagggcgaa ttctgcagat 2280 

ate 2283 



<210> 4 
<211> 826 
<212> DMA 

<213> Candida albicans 
<400> 4 

atgggtagta tgtgaagata caatattgaa 
gcc atggac atttttggat ttgataatta 
gaciagaatca ccatagttgc aagatttgat 
tgtaacatct cttaaagtaa gaagaatatg 
ggaacttgtt gaagatatta aaatagaatg 
aacctcacaa tcaaataaaa acaacagtaa 
gggtactcca tctttaggta aacgtcacaa 
ccgtcgttca ttccacgtcc aaaagaagac 
aatgagatct cacaactggg ccttaaaagc 
aatggcttac ttgaaacacg ttaccagaag 
taaagctcaa accccttccg cttaaactaa 
ttattcatta aagtcatgtt aagcatagca 
atgtaatgac tatttaatat ctgtttaaat 
acaccaaaaa aaaaagaaac aaataaaatc 



agtgtttact agaatatcta agatgtttga 60 
aaaaaagtag caatagatta ttgcgttgga 120 
agatgttaaa atgttcacgc aggcgaaaga 180 
gacatgaata aaaatagata gcactatttt 240 
ggatttcaac atagatattc aaagtaacga 300 
tactaacaat tcaattttta tttttataga 360 
caaatctcac accttatgta acagatgtgg 420 
ctgttcttct tgtggttacc cagctgctaa 480 
caaaagaaga agaactactg gtaccggtag 540 
attcaagaac ggtttccaaa ctggtgttgc 600 
ttactgaagt tattggtcat gcattagtca 660 
aaggaagaat tggttagatt cttgtttaaa 720 
aagaggttta gtctttattt ttttacgtat 780 
tgtatattaa tgttgg 826 



<210> 5 
<211> 978 
<212> DNA 

<213> Candida albicans 



<400> 5 

atgggtacta gtacaagtga agcattgaag 
gtttttgcag aaacaaaaca cgaaaagaat 
gctaaggaag aaagagaaaa ccccgaatta 
gatactatag atagcaaacg tatttatgat 
gacgagtttc agtcatattt cactaacttg 
agtgccaatg ctaaaaaacc ggcctatgaa 
aatgtgacat ttatcaaaag gaagaaggaa 
tcgaatagag actccactgc attgcttgtc 



aacatcaaaa acaaacagcg aagacagaaa 60 
aaacaacgtc ataagcaaag agccgaaaga 120 
agagaggaaa gaatagcagc taatatccca 180 
gagactatag ctgctgaagt tgaaggagat 240 
ttggaagaac caaagatttt gttgacaaca 300 
tttgcagaca tgatcatgga ctttttaccg 360 
tatacaatgc aagatatggc caaatattgc 420 
atcaacgaag acaagaagaa ggtcaatggt 480 
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ataacgctca tcaatttacc tgaagggcca acattttatt tttcgattac atcaatagtt 540 

gatgggaaaa gaattaaggg acacgggaaa gctggtgatt atttacctga gattgtattg 600 

aataatttca attcaagatt gggtaaaact gtgggaagac tatttcaaag tattttccct 660 

cataaacctg aacttcaagg aagacaagtg attactttgc acaatcaacg tgattatatt 720 

tttttcagaa gacatagata tattttcaga aatgaggaaa aggttggatt gcaggaattg 780 

ggtccgcagt ttacattaaa gctaagaaga atgcaaaagg gagtacgtgg tgatgttgtt 840 

tgggaacaca gaccagatat ggaaagagat aagaagaagt tttatttata agcgggtgta 900 

taaaggtagt agtagtgcgt ttataagtat gtgtgtgtgt ttatgcatag atgtgtaaag 960 

agtaatacag ctaattcg 978 

<210> 6 
<211> 619 
<212> DNA 

<213> Candida albicans 
<400> 6 

aactaatttg tttaaacatc aataccaaga agtttttaca attcaatccc acatacacca 60 

ttaattatga atrctgaaaa gattattgaa gttatcattg ccattttctt accaccagta 120 

gctgtgttta tgaaatgtgg tgccactacc ccattatgga ttaacttggt attatgtatc ISO 

tttatttggt tccctgctat cttacatgcc ttatacgttg tgttgaaaga traaacaaac 240 

accagagatt tacrgcttga tgaattgatt actccaaaga gttgtgacta gttcccagtg 300 

tgtttttttt gccttccaac tttcttttac atttttccat tactac .ct gtcttccccc 360 

ctattttgca gagttttcaa aatttatcca aaacatgtta gtcattaaac catattatta 420 

taattattcr tttttgtatt tttttccctt aaaacacgtt aatttattaa tcgtttcgtt 480 

gtttggtatt ttattttttt gtatttatca attggaatat atatctatac atgaatttat 540 

tatccattgt accaattgtt aaaacatttt gttagttttt tgttactagt ataaaannat 600 
aataaaagtt tanttcaac 619 

<210> 7 
<211> 2319 
<212> DNA 

<213> Candida albicans 
<400> 7 

atgacattag ggttcgataa attcataagc aaggtcagca ctcatagacg tcaatctgaa 60 

ccatcaatct tggaaatcgc agccaccaat tctcaaaata aatcgagaag gctaagtatg 120 

gataatggtc attgttatgt tcgtgaatca actaataatc atcatcattt aaataccgtc 180 

gttgataatt tacgacagcg tgcgggatcg ttttcattta tttcacatca ccataatcac 240 

catcagaata gtcacgataa ttatactgtc gatcccctta catcaaacgg agcacgaatt 300 

tcccgatcac gttcacgttc caaatcagtt gggcacggag aagcaatatc accagcgtat 360 

ttttccaaga ataaaaccaa agatttagtg aaacaggaaa cagcacatat cattctgaag 420 

aaattactca acatgttaca agatttggat ttacaaaacc ctattgcatt gaaaacaata 480 

tcacaaggtt cagaatcaaa gttttgtaaa atctacgtgt ctaacactaa taattgtatt 540 

tacttaccag cagcaagttc aacaagtttc acttatgaag atgatgaaaa tggcggcgtt 600 

ataattgctg aagatagaaa igatgaaatg ccaacagcag ttaataacaa tactttgtca 660 

atggatagta taaatcattc agagactgat ttcctggatt ctccaccacc tccagattta 720 

ttttctaaaa tgaaatcatt ccattcacca aattacttga cttcaaaaat cgattctgaa 780 

tgcccaattc cacatacati tgctgtgatt gttgaattaa ccaaggactc tttgattatt 840 
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aaagatcttc atttccaatt tcagtcatta actaccattt tatggccaac tggggatgca 900 

tataatcgga ctcatgccaa ggagaaattt accattggga atatggaatg gcgtacatct 960 

ttaagcgacg ccgactatta tatcaatagt tctaattcca acgatgttaa gctgaaaaac 1020 

ttgggtcctg aagatcttat taatcgaact agagaataca aattaatcga tattgaagaa 1080 

ccaaacaatt catcaaacac tttactggat gatgacatgg atattaataa tattacgtcg 1140 

ccattatcaa cgtcaccaac atcaagttca acttcaacaa attcaacctc caactcattg 1200 

ggttcagatt catataaagc tggtctttat gtatttttat taccaatctt attgccagaa 1260 

catattcctg cttccattct ttctattaat ggttcattgg ctcatacatt actggttgaa 1320 

tgcaataaat atactgataa gttgaatcgg aaatcaaaag tatcagcatc gtacaattta 1380 

cctatggtcc gtactccacc aaacattggt aattccattg ctgataagcc aatttatgtt 1440 

aataggattt ggaatgatgc cgtacattat attataactt tcccccgcaa atatgttact 1500 

ttgggttgtg aacacatgat aaatgtgaaa ttactgccca tggtgaaaga tgtggttatc 1560 

aagcgtatta aatttaatgt attggagaga ataacttatg tttccaaaaa tttatcacga 1620 

gaatatgatt atgatagtga agacccctat tgtattcatc cagtttctaa agaaaataaa 1680 

gtacgtgaac gtgttgtgtc gttatatgaa ttgaaaacga aggcaaaaca-- atcttctggt 1740 

ggacatcttg aagcttataa acaagaagtt atgaaatgtc cggaaaataa ccttttattt 1800 

tcttgttatg aggttgaaaa tgataataat aacggcaacg gcaacggcaa cggcaacgga 1860 

aacaagaacg ttaaacaaaa gaataaagat caaccaatga ttgctacacc tttagatatc 1920 

aatgtttctt taccattttt aactactatg tctgatagtt taattatgac atcagccata 1980 

gaagaagaag gttcagatcz gcctcataca tcaagaagag ggtcggcagt gagtatgact 2040 

gataataata ctaccccaac taacaataac cctttatctc catttttggg agcagtggaa 2100 

actaatggtg ctagtataaa tgaaattggt gatcatacat tattccctga ttctaatttt 2160 

cgacatattg aaattaaacc -ccgattacaa gttacattta ggatttctaa accggatctg 2220 

gataataaaa tgcatcatta rgaagtggtt attgataccc ccatcgtttt acttagttca 2280 

aaatgtcaag aagattctcc tcctccttat agttctgta 2319 



<210> 8 
<211> 255 
<212> DNA 

<213> Candida albicans 



<400> 8 

aacgttcgtg caaaaggcta cactggtgat atccacgcag atgaagagca agtttaatca 60 

actctttgtc aattaatgcr gtacttgttt tcattttatt tgctggcatt taaagaatac 120 

ccatagttca gaaaataaaa ttgaaaaatt taaaaaaaaa cgcaatatca ttcatttttt 180 

ttgttttttt gacaataata ttaatatgta gttaccaatg tttttagatt ttatatgttt 240 
tgaaaaaata gtttg 255 



<210> 9 
<211> 119 
<212> DNA 

<213> Candida albicans 



<400> 9 

aaccttacaa tcattatacc aactatcaaa atcataagac tcttnaactt ctgtttttga 60 
tagttggtat aatgatttat gtattatctt aattcattat tattagtttc ggtcacaaa 119 



<210> 10 
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<211> 60 
<212> PRT 

<213> Candida albicans 
<400> 10 

Met lie Thr Asp Glu Gin Leu Asn Thr lie Ala Leu Thr Phe Gly Phe 
15 10 15^--^ 

Ala Ser He He Leu He He He Tyr His Ala He Ser Thr Asn Val 
20 25 30 

His Lys Leu Glu Asp Glu Thr Pro Ser Ser Ser Phe Thr Arg Thr Asn 
35 40 45 

Thr Thr Glu Thr Thr Val Ala Ser Lys Lys Lys Lys 
50 55 60 



<210> 11 
<211> 426 
<212> PRT 

<213> Candida albicans 
<400> 11 

Met Leu Lys Thr Leu Thr Gin Thr Leu Arg Leu Thr Gly Lys Ala Phe 
15 10 15 

Pro Lys Val Arg Pro Ala Leu He Arg Thr Tyr Ala Ala Phe Asp Arg 
20 25 30 

Ser Lys Pro His Val Asn He Gly Thr He Gly His Val Asp His Gly 
35 ' 40 45 

Lys Thr Thr Leu Thr Ala Ala He Thr Lys Val Leu Ala Glu Gin Gly 
■ 50 55 60 

Gly Ala Asn Phe Leu Asp Tyr Gly Ser He Asp Arg Ala Pro Glu Glu 
65 70 75 80 

Arg Ala Arg Gly He Thr He Ser Thr Ala His Val Glu Tyr Glu Thr 
85 90 95 

Lys Asn Arg His Tyr Ala His Val Asp Cys Pro Gly His Ala Asp Tyr 
100 105 110 

He Lys Asn Met He Thr Gly Ala Ala Gin Met Asp Gly Ala He He 
115 120 125 
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Val Val Ala Ala Thr Asp Giy Gin Met Pro Gin Thr Arg Glu His Leu 
130 135 140 

Leu Leu Ala Arg Gin Val Gly Val Gin Asp Leu Val Val Phe Val Asn 
145 150 155 160 

Lys Val Asp Thr He Asp Asp Pro Glu Met Leu Glu Leu Val Glu^et 
165 170 175 

Glu Met Arg Glu Leu Leu Ser Thr Tyr Gly Phe Asp Gly Asp Asn Thr 
180 185 190 

Pro Val He Met Gly Ser Ala Leu Met Ala Leu Glu Asp Lys Lys Pro 
195 200 205 

Glu He Gly Lys Glu Ala He Leu Lys Leu Leu Asp Ala Val Asp Glu 
210 215 220 

His He Pro Thr Pro Ser Arg Asp Leu Glu Gin Pro Phe Leu Leu Pro 
225 230 235 240 

Val Glu Asp Val Phe Ser He Ser Gly Arg Gly Thr Val Val Thr Gly 
245 250 255 

Arg Val Glu Arg Gly Val Leu Lys Lys Gly Glu Glu He Glu He Val 
260 265 270 

Gly Gly Phe Asp Lys Pro Tyr Lys Thr Thr Val Thr Gly He Glu Met 
275 280 285 

Phe Lys Lys Glu Leu Asp Ser Ala Met Ala Giy Asp Asn Cys Gly Val 
290 295 300 

Leu Leu Arg Gly Val Lys Arg Asp Glu He Lys Arg Giy Met Val Leu 
305 310 315 320 

Ala Lys Pro Gly Thr Ala Thr Ser His Lys Lys Phe Leu Ala Ser Leu 
325 330 335 

Tyr He Leu Thr Ser Glu Glu Gly Gly Arg Ser Thr Pro Phe Gly Glu 
340 345 350 

Gly Tyr Lys Pro Gin Cys Phe Phe Arg Thr Asn Asp Val Thr Thr Thr 
355 360 365 

Phe Ser Phe Pro Glu Gly Glu Gly Val Asp His Ser Gin Met He Met 
370 375 380 
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Pro Gly Asp Asn lie Glu Met Val 
385 390 

Leu Glu Val Asn Gin Arg Phe Asn 
405 

Gly Thr Gly Leu lie Thr Arg lie 
420 



Gly Glu Leu lie Lys Ser Cys Pro 
395 400 

Leu Arg Glu Gly Gly Lys Thr Val 
410 415 

He Glu 
425 



<210> 12 
<211> 699 
<212> PRT 

<213> Candida albicans 
<400> 12 

Met Glu Val Thr Gin Arg Thr Gin Ser Gin Thr Gin Pro Thr Gin Gin 
1 5 10 15 

Ser Pro Thr Thr Gin Thr Gin Thr Gin Ser Lys Glu Asp Gin Asn Arg 
20 25 30 

He Cys Gin Leu He Cys Ser Thr Gly Gin Phe Gly Asn Tyr Asp Leu 
35 40 45 

Asn He Asn Asp Lys Thr He Val Gin Gly Lys Met Thr Trp Tyr Phe 
50 55 60 

Gly Arg Asp Pro Asn Ser Asp Leu Gin Val Ala Ser Ser Ser Arg He 
65 70 . 75 80 

Ser Asn Lys His Phe Gin He Trp Leu Asn Phe Asn Asp Lys Ser Leu 
85 90 95 

Trp He Lys Asp Thr Ser Thr Asn Gly Thr His Leu Asn Asn Ser Arg 
100 105 110 

Leu Val Lys Gly Ser Asn Tyr Leu Leu Asn Gin Gly Asp Glu He Ala 
115 120 125 

Val Gly Val Gly Arg Asp Glu Asp Val Val Arg Phe Val Val Val Phe 
130 135 140 

Gly Asp Lys Tyr Asn Pro Ala Lys Leu Pro Asp Ser Thr Asn Thr He 
145 150 155 160 

Lys Asp Glu Gly He Tyr Lys Asp Phe He Val Lys Asn Glu Thr He 
165 170 175 
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Gly Gin Gly Ala Phe Ala Thr Val Lys Lys Ala He Glu Arg Ser Thr 
180 185 190 

Gly Glu Ser Tyr Ala Val Lys He He Asn Arg Arg Lys Ala Leu Asn 
195 200 205 

Thr Gly Gly Gly Ser Ala Met Ala Gly Val Asp Arg Glu Leu Ser He 
210 215 220 

Leu Glu Arg Leu Asn His Pro Asn He Val Ala Leu Lys Ala Phe Tyr 
225 230 235 240 

Glu Asp Met Asp Asn Tyr Tyr He Val Met Glu Leu Val Pro Gly Gly 
245 250 / 255 

Asp Leu Met Asp Phe Val Ala Ala Asn Gly Ala He Gly Glu Asp Ala 
260 265 270 

Thr Gin Val He Thr Lys Gin He Leu Glu Gly He Ala Tyr Val His 
275 280 285 

Asn Leu Gly He Ser His Arg Asp Leu Lys Pro Asp Asn He Leu He 
290 295 300 

Met Gin Asp Asp Pro He Leu Val Lys He Thr Asp Phe Gly Leu Ala 
305 310 315 320 

Lys Phe Ser Asp Asn Ser Thr Phe Met Lys Thr Phe Cys Gly Thr Leu 
325 330 335 

Ala Tyr Val Ala Pro Glu Val He Thr Gly Lys Tyr Gly Ser Ser Gin 
340 345 350 

Met Glu Ser Gin Gin Lys Asp Asn Tyr Ser Ser Leu Val Asp He Trp 
355 360 365 

Ser Leu Gly Cys Leu Val Tyr Val Leu Leu Thr Ser His Leu Pro Phe 
370 375 380 

Asn Gly Lys Asn Gin Gin Gin Met Phe Ala Lys He Lys Arg Gly Glu 
385 390 395 400 

Phe His Glu Ala Pro Leu Asn Ser Tyr Asp He Ser Glu Asp Gly Arg 
405 410 415 



Asp Phe Leu Gin Cys Cys Leu Gin Val Asn Pro Lys Leu Arg Met Thr 
420 425 430 
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Ala Ala Glu Ala Leu Lys His Lys Trp Leu Gin Asp Leu Tyr Glu Glu 
435 440 445 

Asp Ser Val Lys Ser Leu Ser Leu Ser Gin Ser Gin Ser Gin Gin Ser 
450 455 460 

Arg Lys lie Asp Asn Gly lie His lie Glu Ser Leu Ser Lys lie Asp 
465 470 475 480 

Glu Asp Val Met Leu Arg Pro Leu Asp Ser Glu Arg Asn Arg Lys Ser 
485 490 495 

Ser Lys Gin Gin Asp Phe Lys Val Pro Lys Arg Val lie Pro Leu Ser 
500 505 510 

Gin His Pro Ala Thr Pro Leu Pro Met Ser Gin Pro Lys Lys Arg Pro 
515 520 525 

Tyr Gin lie Asp Pro Arg Thr Asn Lys Lys Val Asp Leu Glu Glu Pro 
530 535 540 

Ser Thr Ser Lys Lys Val Lys Leu Ser Asp Ser Val Val Ala Glu Asp 
545 550 555 560 

Tyr Leu Lys Leu Gly Pro Leu Ala Asn Ser Leu Phe Gin Glu Thr lie 
565 570 575 

Asn lie Ser Lys Ser Pro Phe Ser Phe Gly Arg Asn Asp Thr Cys Asp 
580 585 590 

Cys Glu lie Asp Asp Asp Arg Leu Ser Lys Leu His Cys Val lie Thr 
595 600 605 

Lys Glu Asn Asp Ser lie Trp Leu Leu Asp Lys Ser Thr Asn Ser Cys 
610 615 620 

Leu Val Asn Asn Thr Ser Val Gly Lys Gly Asn Lys Val Leu Leu Arg 
625 630 635 640 

Gly Gly Glu lie Leu His Leu Phe Phe Asp Pro Leu Ser Ser Gin His 
645 650 655 

He Gly Phe Lys Val Val Leu Val Asp Gin Ser Ser Gly Glu His Lys 
660 665 670 

Ser Gin Val Glu Val Leu Lys Gin Thr Ser Glu Glu Met Asn He He 
675 680 685 
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Pro Leu lie Ser Gly Leu Ser Ser He Ser Ser 
690 695 



<210> 13 
<211> 295 
<212> PRT 

<213> Candida albicans 
<400> 13 

Met Gly Thr Ser Thr Ser Glu Ala Leu Lys Asn He Lys Asn Lys Gin 
15 10 15 

Arg Arg Gin Lys Val ?he Ala Glu He Lys His Glu Lys Asn- Lys Gin 
20 25 30 

Arg His Lys Gin Arg Ala Glu Arg Ala Lys Glu Glu Arg Glu Asn Pro 
35 40 45 

Glu Leu Arg Glu Glu Arg He Ala Ala Asn He Pro Asp Thr He Aso 
50 55 60 

Ser Lys Arg He Tyr Asp Glu Thr He Ala Ala Glu Val Glu Gly Asp 
65 70 75 80 

Asp Glu Phe Gin Ser Tyr Phe Thr Asn Leu Leu Glu Glu Pro Lys He 
85 90 95 

Leu Leu Thr Thr Ser Ala Asn^ Ala Lys Lys Pro Ala Tyr Glu Phe Ala 
100 105 110 

Asp Met He Met Asp Phe Leu Pro Asn Val Thr Phe He Lys Arg Lys 
115 120 125 

Lys Glu Tyr Thr Met Gin Asp Met Ala Lys Tyr Cys Ser Asn Arg Asp 
130 135 140 

Phe Thr Ala Leu Leu Val He Asn Glu Asp Lys Lys Lys Val Asn Gly 
145 150 155 160 

He Thr Leu He Asn Leu Pro Glu Gly Pro Thr Phe Tyr Phe Ser He 
165 170 175 

Thr Ser He Val Asp Gly Lys Arg He Lys Gly His Gly Lys Ala Gly 
180 185 190 

Asp Tyr Leu Pro Glu He Val Leu Asn Asn Phe Asn Ser Arg Leu Gly 
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195 200 

Lys Thr Val Gly Arg Leu Phe Gin 
210 215 

Leu Gin Gly Arg Gin Val He Thr 
225 230 

Phe Phe Arg Arg His Arg Tyr He 
245 

Leu Gin Giu Gly Pro Gin Phe Thr 
260 

Gly Val Arg Gly Asp Val Val Trp 
275 280 

Asp Lys Lys Lys Phe Tyr Leu 
290 295 



205 

Ser He Phe Pro His Lys Pro Glu 
220 

Leu His Asn Gin Arg Asp Tyr He 
235 "^'240 

Phe Arg Asn Glu Glu Lys Val Gly 
250 255 

Leu Lys Leu Arg Arg Met Gin Lys 
265 270 

Glu His Arg Pro Asp Met Glu Arg 
285 



•,210> 14 
<211> 55 
<212> PRT 

<213> Candida albicans 
<400> 14 

Met Asn Ser Glu Lys He He Glu Val He lie Ala He Phe Leu Pro 
15 10 15 

Pro Val Ala Val Phe Met Lys Cys Gly Ala Thr Thr Pro Leu Tr? He 
20 25 30 

Asn Leu Val Leu Cys He Phe He Trp Phe Pro Ala He Leu His Ala 
35 40 45 

Leu Tyr Val Val Leu Lys Asp 
50 55 



<210> 15 
<211> 773 
<212> PRT 

<213> Candida albicans 
<400> 15 

Met Thr Leu Gly Phe .-.sp Lys Phe He Ser Lys Val Ser Thr His Arg 
15 10 15 
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Arg Gin Ser Glu Pro 
20 

Asn Lys Ser Arg Arg 
35 

Glu Ser Thr Asn Asn 
50 

Arg Gin Arg Ala Gly 
65 

His Gin Asn Ser His 
85 

Gly Ala Arg He Ser 
100 

Gly Glu Ala He Ser 
115 

Leu Val Lys Gin Glu 
130 

Met Leu Gin Asp Leu 
145 

Ser Gin Gly Ser Glu 
165 

Asn Asn Cys He Tyr 
180 

Glu Asp Asp Glu Asn 
195 

Glu Met Pro Thr Ala 
210 

Asn His Ser Glu Thr Asp 
225 230 

Phe Ser Lys Met Lys Ser 
245 

He Asp Ser Glu Cys Pro 
260 



Ser He Leu Glu He Ala 
25 

Leu Ser Met Asp Asn Gly 
40 

His His His Leu Asn Thr 
55 

Ser Phe Ser Phe He Ser 
70 75 

Asp Asn Tyr Thr Val Asp 
90 

Arg Ser Arg Ser Arg Ser 
105 

Pro Ala Tyr Phe Ser Lys 
120 

Thr Ala His He He Ser 
135 

Asp Leu Gin Asn Pro He 
150 155 

Ser Lys Phe Cys Lys He 

^ ^ 170 

Leu Pro Ala Ala Ser Ser 
185 

Gly Gly Val He He Ala 
200 

Val Asn Asn Asn Thr Leu 
215 

Phe Ser Asp Ser Pro 
235 

Phe His Ser Pro Asn 
250 

He Pro His Thr Phe 
265 



Ala Thr Asn Ser Gin 
30 

His Cys Tyr Val Arg 
45 

Val Val Asp Asn Leu 
60 

His His His Asn His 
80 

Pro Leu Thr Ser Asn 
95 

Lys Ser Val Gly His 
110 

Asn Lys Thr Lys Asp 
125 

Lys Lys Leu Leu Asn 
140 

Ala Leu Lys Thr He 
160 

Tyr Val Ser Asn Thr 
175 

Thr Ser Phe Thr Tyr 
190 

Glu Asp Arg Asn Asp 
205 

Ser Met Asp Ser He 
220 

Pro Pro Pro Asp Leu 
240 

Tyr Leu Thr Ser Lys 
255 

Ala Val He Val Glu 
270 
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Leu Thr Lys Asp Ser Leu lie He Lys Asp Leu His Phe Gin Phe Gin 
275 280 285 

Ser Leu Thr Thr He Leu Trp Pro Thr Gly Asp Ala Tyr Asn Arg Thr 
290 295 300 

His Ala Lys Glu Lys Phe Thr He Gly Asn Met Glu Trp Arg Thr Ser 
305 310 315 320 

Leu Ser Asp Ala Asp Tyr Tyr He Asn Ser Ser Asn Ser Asn Asp Val 
325 330 335 

Lys Ser Lys Asn Leu Gly Pro Glu Asp Leu He Asn Arg Thr Arg Glu 
340 345 350 

Tyr Lys Leu He Asp He Glu Glu Pro Asn Asn Ser Ser Asn Ser Leu 
355 360 365 

Ser Asp Asp Asp Met Asp He Asn Asn He Thr Ser Pro Leu Ser Thr 
370 375 380 

Ser Pro Thr Ser Ser Ser Thr Ser Thr Asn Ser Thr Ser Asn Ser Leu 
385 390 395 400 

Gly Ser Asp Ser Tyr Lys Ala Gly Leu Tyr Val Phe Leu Leu Pro He 
405 410 415 

Leu Leu Pro Glu His He Pro Ala Ser He Val Ser He Asn Gly Ser 
420 425 430 

Leu Ala His Thr Leu Ser Val Glu Cys Asn Lys Tyr Thr Asp Lys Leu 
435 440 445 

Asn Arg Lys Ser Lys Val Ser Ala Ser Tyr Asn Leu Pro Met Val Arg 
450 455 460 

Thr Pro Pro Asn He Gly Asn Ser He Ala Asp Lys Pro He Tyr Val 
465 470 475 480 

Asn Arg He Trp Asn Asp Ala Val Kis Tyr He He Thr Phe Pro Arg 
485 490 495 

Lys Tyr Val Thr Leu Gly Cys Glu His Met He Asn Val Lys Leu Ser 
500 505 510 



Pro Met Val Lys Asp Val Val He Lys Arg He Lys Phe Asn Val Leu 
515 520 525 



14 



wo 00/34481 PCT/EP99/09833 



Glu Arg lie Thr Tyr Val Ser Lys Asn Leu Ser Arg Glu Tyr Asp Tyr 
530 535 540 

Asp Ser Glu Asp Pro Tyr Cys He His Pro Val Ser Lys Glu Asn Lys 
545 550 555 560 

Val Arg Glu Arg Val Val Ser Leu Tyr Glu Leu Lys Thr Lys Ala Lys 
565 570 575 

Gin Ser Ser Gly Gly His Leu Glu Ala Tyr Lys Gin Glu Val Met Lys 
580 585 590 

Cys Pro Glu Asn Asn Leu Leu Phe Ser Cys Tyr Glu Val Glu Asn Asp 
595 600 605 

Asn Asn Asn Gly Asn Gly Asn Gly Asn Gly Asn Gly Asn Lys Asn Val 
610 615 620 

Lys Gin Lys Asn Lys Asp Gin Pro Met He Ala Thr Pro Leu Asp He 
625 630 635 640 

Asn Val Ser Leu Pro Phe Leu Thr Thr Met Ser Asp Ser Leu He Met 
645 650 655 

Thr Ser Ala He Glu Glu Glu Gly Ser Asp Ser Pro His Thr Ser Arg 
660 665 670 

Arg Gly Ser Ala Val Ser Met Thr Asp Asn Asn Thr Thr Pro Ser Asn 
675 680 685 

Asn Asn Pro Leu Ser Pro Phe Leu Gly Ala Val Glu Thr Asn Gly Ala 
690 695 700 

Ser He Asn Glu He Gly Asp His Thr Leu Phe Pro Asp Ser Asn Phe 
705 710 715 720 

Arg His He Glu He Lys His Arg Leu Gin Val Thr Phe Arg He Ser 
725 730 735 

Lys Pro Asp Ser Asp Asn Lys Met His His Tyr Glu Val Val He Asp 
740 745 750 

Thr Pro He Val Leu Leu Ser Ser Lys Cys Gin Glu Asp Ser Pro Pro 
755 760 765 

Pro Tyr Ser Ser Val 
770 
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<210> 16 
<211> 90 
<212> PRT 

<213> Candida albicans 
<400> 16 

Met Gly Glu Gly Thr Pro Ser Leu Gly Lys Arg His Asn Lys Ser His 
15 10 15 

Thr Leu Cys Asn Arg Cys Gly Arg Arg Ser Phe His Val Gin Lys Lys 
20 25 30 

Thr Cys Ser Ser Cys Gly Tyr Pro Ala Ala Lys Met Arg Sej? His Asn 
35 40 45 

Trp Ala Leu Lys Ala Lys Arg Arg Arg Thr Thr Gly Thr Gly Arg Met 
50 55 60 

Ala Tyr Leu Lys His Val Thr Arg Arg Phe Lys Asn Gly Phe Gin Thr 
65 -0 75 80 

Gly Val Ala Lys Ala Gin Thr Pro Ser Ala 
85 90 



16 



